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Abstract. The real trees form a class of metric spaces that extends 
the class of trees with edge lengths by allowing behavior such as infinite 
total edge length and vertices with infinite branching degree. Aldous's 
Brownian continuum random tree, the random tree-like object naturally 
associated with a standard Brownian excursion, may be thought of as a 
random compact real tree. The continuum random tree is a scaling limit 
as A'^ ^ oo of both a critical Galton- Watson tree conditioned to have to- 
tal population size as well as a uniform random rooted combinatorial 
tree with N vertices. The Aldous-Broder algorithm is a Markov chain 
on the space of rooted combinatorial trees with N vertices that has the 
uniform tree as its stationary distribution. We construct and study a 
Markov process on the space of all rooted compact real trees that has 
the continuum random tree as its stationary distribution and arises as 
the scaling limit as TV ^ cx) of the Aldous-Broder chain. A key tech- 
nical ingredient in this work is the use of a pointed Gromov-Hausdorff 
distance to metrize the space of rooted compact real trees. 
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1. Motivation and background 

It is shown in |Ald91al lAldQlbl IAld93| (see also |L(;99[ IPit()2aj ') that a 
suitably rescaled family of Galton- Watson trees, conditioned to have total 
population size n, converges as n ^ oo to the Brownian continuum random 
tree ( CRT), which can be thought of as the tree inside a standard Brown- 
ian excursion (more precisely, twice a standard Brownian excursion if one 
follows Aldous's choice of re-scaling). Aldous describes a procedure for rep- 
resenting trees as closed subsets of £^ , and convergence is here in the sense of 
weak convergence of probability measures on the this space of closed subsets 
equipped with the Hausdorff distance. 

The Brownian CRT can be obtained as an almost sure limit by growing 
finite trees (that is, trees with finitely many leaves and finite total branch 
length) in continuous time as follows (at all times t > the procedure will 
produce a rooted tree IZt with total edge length t): 

• Write Ti , T2 , . . . for the successive arrival times of an inhomogeneous 
Poisson process with arrival rate t at time t > 0. Call r„ the n**^ cut 
time. 

• Start at time with the 1-tree (that is a line segment with two ends), 
7^0) of length zero (T^o is "really" the trivial tree that consists of one 
point only, but thinking this way helps visualize the dynamics more 
clearly for this semi- formal description). Identify one end of TZq as 
the root. 

• Let this line segment grow at unit speed until the first cut time ri. 

• At time ri pick a point uniformly on the segment that has been 
grown so far. Call this point the first cut point. 

• Between time ri and time r2, evolve a tree with 3 ends by letting a 
new branch growing away from the first cut point at unit speed. 

• Proceed inductively: Given the n-tree (that is, a tree with n + 1 
ends), TZr,^-, pick the n-th cut point uniformly on TZr^- to give an 
n + 1-tree, TZt„, with one edge of length zero, and for t G [Tn,Tn+i[, 
let TZt be the tree obtained from TZr„ by letting a branch grow away 
from the n^^ cut point with unit speed. 

The tree T^m- has the same distribution as the subtree of the CRT that 
arises from sampling twice the standard Brownian excursion at n i.i.d. uni- 
form points on the unit interval, and the Brownian CRT is the limit (with 
respect to a suitable notion of convergence) of the increasing family of rooted 
finite trees {TZt)t>o- 

Again using the cut times {Ti,r2, . . .}, a closely related way of growing 
rooted trees {Tt)t>o can be described as follows (here again the tree at time 
t will have total edge length t): 

• Start with the 1-tree (with one end identified as the root and the 
other as a leaf), Tq, of length zero. 

• Let this segment grow at unit speed on the time interval [0,ri[, and 
for t € [0,ri[ let Tt be the rooted 1-tree that has its points labeled 
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by the interval [0, t] in such a way that the root is t and the leaf is 
0. 

• At time ri sample the first cut point uniformly along the tree Tn-, 
prune off the piece of that is above the cut point (that is, prune 
off the interval of points that are further away from the root t than 
the first cut point). 

• Re-graft the pruned segment such that its cut end and the root are 
glued together. Just as we thought of Tq as a tree with two points, 
(a leaf and a root) connected by an edge of length zero, we take 

to be the the rooted 2-tree obtained by "ramifying" the root 
into two points (one of which we keep as the root) that are joined 
by an edge of length zero. 

• Proceed inductively: Given the labeled and rooted n-tree, for 
t G [r„_i,r„[, let Tt be obtained by letting the edge containing the 
root grow at unit speed so that the points in 7j correspond to the 
points in the interval [0, t] with t as the root. At time t„, the n^^ cut 
point is sampled randomly along the edges of the n-tree, and 
the subtree above the cut point (that is the subtree of points further 
away from the root than the cut point) is pruned off and re-grafted 
so that its cut end and the root are glued together. The root is then 
"ramified" as above to give an edge of length zero leading from the 
root to the rest of the tree. 




Figure 1 illustrates how the tree-valued processes {TZt', t > 0) and 
{%\ t > 0) evolve. (The hold dots re-present an edge of length zero, 
while the small dots indicate the position of the cut point that is going 
to show up at the next moment.) 



The link between these two dynamics for growing trees is provided by 
Proposition 14. ![ where we show that '7^„- has the same law as 7^r„- for 
each n. 
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The process {Tt)t>o is clearly a time- homogeneous Markov process and 
we can run its dynamics (which we will refer to as root growth with re- 
grafting) starting with any finite tree. The resulting process evolves via 
alternating deterministic root growth and random jumps due to re-grafting 
and is an example of a piecewise- deterministic Markov process. A general 
framework for such processes was introduced in |Dav84j as an abstraction 
of numerous examples in queueing and control theory, and this line of re- 
search was extensively developed in the subsequent monog raph |Dav93j . A 
more general formulation in terms of martingales and additive function- 
als can be found in |JS96j . Some other appearances of such processes are 
[EP981 ICDPnil ID(:991 1(:ai931 l(;os9n| . We note also that pruning and re- 
grafting operations such as the one we consider here play an important role 
in algorithms that attempt to reconstruct optimal phylogenetic trees from 
data by moving through tree space as part of a hill-climbing or simulated 
annealing procedure (see, for example, |Fel03j 1 . 

The crucial feature of the root growth with re-grafting dynamics is that 
they have a simple projective structure: If one follows the evolution of the 
points in a rooted subtree of the initial tree along with that of the points 
added at later times due to root growth, then these points together form a 
rooted subtree at each period in time and this subtree evolves autonomously 
according to the root growth with re-grafting dynamics. 

The presence of this projective structure suggests that one can make sense 
of the notion of running the root growth with re-grafting dynamics starting 
from an initial "tree" that has exotic behavior such as infinitely many leaves, 
points with infinite branching, and infinite total edge length - provided that 
this "tree" can be written as the increasing limit of a sequence of finite 
trees in some appropriate sense. Moreover, by the remarks above about the 
relationship between the processes TZ and T, this extended process should 
have a stationary distribution that is related to the Brownian CRT, and the 
stationary distribution should be the limiting distribution for the extended 
process starting from any initial state. 

One of our main objectives is to give rigorous statements and proofs of 
these and related facts. 

Once the extended process has been constructed, we gain a new perspec- 
tive on objects such as standard Brownian excursion and the associated 
random triangulation of the circle (see |Ald94a| IAld94b| lAldODj ) . For exam- 
ple, suppose we follow the height (that is, distance from the root) of some 
point in the initial tree. It is clear that this height evolves autonomously as 
a one-dimensional piecewise-deterministic Markov process that: 

• increases linearly at unit speed (due to growth at the root), 

• makes jumps at rate x when it is in state x (due to cut points falling 
on the path that connects the root to the point we are following), 

• jumps from state x to a point that is uniformly distributed on [0, x] 
(due to re-grafting at the root). 
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We call such a process a Rayleigh process because, as we will show in Sec- 
tion IHl this process converges to the standard Rayleigh stationary distribu- 
tion R on given by 

R(]3;,oo[) = e-^'/^ x > 0, 

(thus R is also the distribution of the Euclidean length of a two-dimensional 
standard Gaussian random vector or, up to a scaling constant, the distribu- 
tion of the distance to the closest point to the origin in a standard planar 
Poisson process). Now, if B^^ := {B^; u G [0,1]} is standard Brownian 
excursion and U is an independent uniform random variable on [0, 1], then 
there is a valid sense in which IB^' has the law of the height of a randomly 
sampled leaf of the Brownian CRT, and this accords with the well-known 
result 

(1.1) P{2B^J= edx} = 'R{dx). 

In order to extend the root growth with re-grafting dynamics to infinite 
trees we will need to fix on a suitable class of infinite trees and a means of 
measuring distances between them. Our path to extending the definition 
of a tree to accommodate the "exotic" behaviors mentioned above will be 
the one followed in the so-called T-theory (see |Dre84l DMT96I ITer97p . T- 
theory takes finite trees to be just metric spaces with certain characteristic 
properties and then defines a more general class of tree-like metric spaces 
called real-trees or M-trees. We note that one of the primary impetuses for 
the development of T-theory was to provide mathematical tools for concrete 
problems in the reconstruction of phylogenies. We also note that M-trees 
have been objects of intensive study in geometric group theory (see, for 
example, the surveys |Sha87l IMor921 ISha91l IBes02j and the recent book 
[ChiOlj l. Diffusions on an R-tree were investigated in |EvaOOj . Some of the 
results on the space of M-trees obtained in this paper have already been found 
useful in the study scaling limits of Galton-Watson branching processes in 
|DL(;n4j . 

Once we have an extended notion of trees as just particular abstract met- 
ric spaces (or, more correctly, isometry classes of metric spaces), we need a 
means of assigning a distance between two metric spaces, and this is provided 
by the Gromov-Hausdorff distance. This distance originated in geometry as 
a means of making sense of intuitive notions such as the convergence to 
Euclidean space of a re-scaled integer lattice as the grid size approaches 
zero or the convergence to Euclidean space of a sphere when viewed from 
a fixed point (for example, the North Pole) as the radius approaches infin- 
ity. Our approach is thus rather different to Aldous's in which trees are 
viewed as closed subsets of via a particular choice of embedding and dis- 
tances are measured using the familiar Hausdorff metric. Although our use 
of the Gromov-Hausdorff distance to metrize the space of M-trees turns out 
to quite elegant and easy to work with, it also appears to be rather novel, 
and thus much of our work in this paper is directed towards establishing 
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facts about the structure of this space. However, we think that the resulting 
mathematics is interesting in its own right and potentiahy useful in other 
investigations where trees have hitherto been coded as other objects such as 
paths (for example, |L(T99[lDL(Tn2j ). We remark in passing that the papers 
[Pau89 . Pau8^ are an application of the Gromov-Hausdorff distance to the 
study of M-trees that is quite different to ours. 

We note that there is quite a large literature on other approaches to 
"geometrizing" and "coordinatizing" spaces of trees. The first construction 
of codes for labeled trees without edge-length goes back to 1918: Priifer 
|Priil8j sets up a bijection between labeled trees of size n and the points of 
{1,2, . . . ,71}""^. Phylogenetic trees are identified with points in matching 
polytopes in DH98^, and BHVOl^ equips the space of finite phylogenetic 
trees with a fixed number of leaves with a metric that makes it a cell-complex 
with non-positive curvature. 

The plan of the rest of the paper is as follows. We collect some results 
on the set of isometry classes of rooted compact R-trees and the properties 
of the Gromov-Hausdorff distance in Section [5J We construct the extended 
root growth with re-grafting process in Section |31 via a procedure that is 
roughly analogous to building a discontinuous Markov process in Euclidean 
space as the solution of a stochastic differential equation with respect to a 
sufficiently rich Poisson noise. This approach is particularly well-suited to 
establishing the strong Markov property. In Section |3 we establish the fact 
claimed above that has the same law as 7^r„- for each n. We prove in 
SectionElthat the extended root growth with re-grafting process is recurrent 
and convergent to the continuum random tree stationary distribution. We 
verify that the extended process has a Feller semigroup in Section El and 
show in Section 13 that it is a re-scaling limit of the Markov chain appearing 
in the Aldous-Broder algorithm for simulating a uniform rooted tree on 
some finite number of vertices. We devote Section |H1 to a discussion of the 
Rayleigh process described above. 

2. M-TREES 

2.1. Unrooted trees. A complete metric space {X,d) is said to be an M- 
tree if it satisfies the following axioms: 

Axiom 1 (Unique geodesies) For all x,y G X there exists a unique 
isometric embedding (ji^^y : [0, d(x,y)] X such that (j)x,y{0) = x and 
4>x,y{d{x,y)) = y. 

Axiom 2 (Loop-free) For every injective continuous map : [0,1] ^ X 
one has V([0, 1]) = ./-v-W.^a) ([0, d(^(0), V(l))]). 

We refer the reader to ( 1 1 )re84[ rnT9fil 1 1 ) M T96I ITer97j ) for background on 
M-trees. A particularly useful fact is that a metric space {X, d) is an M-tree 
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if and only if it is complete, path-connected, and satisfies the so-called four 
point condition, that is, 

d{xi,X2) + d{x3,X4) 

(2.1) 

< max{d{xi,X3) + d{x2,X4), d{xi,X4) + d{x2,xs)} 

for all xi, . . . , a;4 G X. 

Recall that the HausdorjJ distance between two subsets Ai, A2 of a metric 
space (X, d) is defined as 

(2.2) dYi{Ai,A2) := inf{e > 0; C [7,(^2) and A2 C [/^(^i)}, 
where 

(2.3) U,{A) := {x G X- d{x, A) < s}. 

Based on this notion of distance between closed sets, we define the Gromov- 
Hausdorff distance, dQii{Xi, X2), between two metric spaces {Xi,dxi) and 
{X2, dx2) as the infimum of dH(^i, -^2) ^ metric spaces X| and X2 that 
are isomorphic to Xi and X2, respectively, and that are subspaces of some 
common metric space Z (compare \V^mm . BH99, BBJOl]). The Gromov- 
Hausdorff distance defines a finite metric on the space of all isometry classes 
of compact metric spaces (see, for example. Theorem 7.3.30 in |BBI01j ). 

Let (T, dcH) be the metric space of isometry classes of compact R-trees 
equipped with dcH- We will elaborate T slightly to incorporate the notion 
of rooted trees, and this latter space of rooted trees will be the state space 
of the Markov process having root growth with re-grafting dynamics that 
we are going to construct. We will be a little loose and sometimes refer to 
an M-tree as an element of T rather than as a class representative of an 
element. 

Remark As we remarked in the Introduction, Aldous's approach to formal- 
izing the intuitive notion of an infinite tree and putting a metric structure 
on the resulting class of objects is to work with particular closed subsets of 
and to measure distances using the Hausdorff metric on closed sets. Seen 
in the light of our approach, Aldous's approach uses the distance between 
two particular representative elements for the isometry classes of a pair of 
trees rather than the two that minimize the Hausdorff distance. In general 
this leads to greater distances and hence a topology that is stronger than 
ours. 

The following results says that, at the very least, T equipped with the 
Gromov-Hausdorff distance is a "reasonable" space on which to do proba- 
bility theory. 

Theorem 1. The metric space (T,(iGH) is complete and separable. 

Before we prove Theorem ^ we point out that a direct application of 
the stated definition of the Gromov-Hausdorff distance requires an optimal 
embedding into a new metric space Z. While this definition is conceptually 
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appealing and builds on the more familiar Hausdorff distance between sets, 
it turns out to often not be so useful for explicit computations in concrete ex- 
amples. A re-formulation of the Gromov-Hausdorff distance is suggested by 
the following observation. Suppose that two spaces {Xi,dxi) and {X2,dx2) 
are close in the Gromov-Hausdorff distance as witnessed by isometric em- 
beddings fi and /2 into some common space Z. The map that associates 
each point in xi G Xi to a point in X2 G X2 such that dz { f i{xi), f2{x2)) is 
minimal should then be close to an isometry onto its image, and a similar 
remark holds with the roles of Xi and X2 reversed. 

In order to quantify the observation of the previous paragraph, we require 
some more notation. A subset 5ft C Xi x X2 is said to be a correspondence 
between sets Xi and X2 if for each xi € Xi there exists at least one X2 € X2 
such that {xi,X2) € and for each 1/2 G X2 there exists at least one 
yi £ Xi such that (^1,2/2) ^ Given metrics dx^ and dx2 on Xi and X2, 
respectively, the distortion of 5ft is defined by 

(2.4) disC^i) := sup{\dxi{xi,yi) - dx2{x2,y2)\; (a^i, X2), (yi, 2/2) G 5ft}. 
Then 

(2.5) dGn{{X,,dx,),{X2,dx2)) = ^ inf dis(5ft), 

where the infimum is taken over all correspondences 3? between Xi and X2 
(see, for example. Theorem 7.3.25 in jBBIOl^ ) . 

The following result is also useful in the proof of Theorem ^ 

Lemma 2.1. The set T of compact M-trees is a closed subset of the space 
of compact metric spaces equipped with the Gromov-Hausdorff distance. 

Proof. It suffices to note that the limit of a sequence in T is path-connected 
(see, for example. Theorem 7.5.1 in lBBIOI) ) and satisfies the four point 
condition 1)2. 1() . (indeed, as remarked after Proposition 7.4.12 in [BIJI0T|. 
there is a "meta-theorem" that if a feature of a compact metric space can 
be formulated as a continuous property of distances among finitely many 
points, then this feature is preserved under Gromov-Hausdorff limits). □ 

Proof of Theorem^ We start by showing separability. Given a compact 
R-tree, T, and e > 0, let Se be a finite e-net in T. For a,b £ T, let 

(2.6) [a,6[:=</.„,fe([0,d(a,6)[) and ]a,b[:= (l)a,b{]0,d{a,b)[) 

be the unique half open and open, respectively, arc between them, and write 
Te for the subtree of T spanned by S^, that is, 

(2.7) Te :=\J [x,y] and dr^ := d\ 

Obviously, Tg is still an e-net for T, and hence dQY{{Tir,T) < dH{T£,T) < e. 

Now each is just a "finite tree with edge- lengths" and can clearly 
be approximated arbitrarily closely in the dcH-nietric by trees with the 
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same tree topology (that is, "shape"), and rational edge-lengths. The set of 
isometry types of finite trees with rational edge-lengths is countable, and so 
(T,dGH) is separable. 

It remains to establish completeness. It suffices by Lemma 12.11 to show 
that any Cauchy sequence in T converges to some compact metric space, or, 
equivalently, any Cauchy sequence in T has a subsequence that converges 
to some metric space. 

Let {Tn)nGN be a Cauchy sequence in T. By Exercise 7.4.14 and The- 
orem 7.4.15 in |BBIfllj . a sufficient condition for this sequence to have a 
subsequential limit is that for every e > there exists a positive number 
N = N(£) such that every T„ contains an e-net of cardinality N. 

Fix e > and no = no(e) such that dGH{Tm,Tn) < e/2 for m,n > uq. 
Let SriQ be a finite (e/2)-net for T^^ of cardinality N. Then by 1)2. 5(1 for 
each n > uq there exists a correspondence between T^g and T„ such that 
dis(5Rri) < £• For each x G T^^, choose fn{x) G Tn such that (x, fn{x)) € "^n- 
Since for any y & Tn with {x,y) £ K„, dT„iy,fnix)) < dis(3f?„), for all 
n> Uq, the set fniSno) is an e-net of cardinality for r„, n > no. □ 



2.2. Unrooted trees with 4 leaves. For the sake of reference and estab- 
lishing some notation, we record here some well-known facts about recon- 
structing trees from a knowledge of the distances between the leaves. We 
remark that the fact that trees can be reconstructed from their collection of 
leaf-to-leaf distances (plus also the leaf-to-root distances for rooted trees) is 
of huge practical importance in so-called distance methods for inferring phy- 
logenetic trees from DNA sequence data, and the added fact that one can 
build such trees by building subtrees for each collection of four leaves is the 
starting point for the sub-class of distance methods called quartet methods. 
We refer the reader to |Fein3| ISSOSj for an extensive description of these 
techniques and their underlying theory. 

Lemma 2.2. The isometry class of an unrooted tree (T, d) with four leaves 
is uniquely determined by the distances between the leaves ofT. 

Proof. Let {xi, X2, X3, X4} be the set of leaves of T. The tree T has one of 
four possible shapes: 




Figure 2 shows the 4 different shapes of a labeled tree with 4 leaves. 
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Consider case (/), and let yi^2 be the uniquely determined branch point 
on the tree that lies on the arcs [xi, X2] and [xi, 2:3], and 2/3,4 be the uniquely 
determined branch point on the tree that lies on the arcs [2:3, 2:4] and [xi, X3]. 
Observe that 

d{xi,yi^2) = ^(ci(xi, X2) + X3) - d(x2, X3)) 
d{x2,yi,2) = ^{d{xi,X2) + d{x2,X3) - d{xi,X3)) 
(2.8) d{x3,y3^4) = ^(d(x3, X4) + d(xi, X3) - d(xi, X4)) 
d{x4,y3A) = ^id{x3,X4) + d(xi, X4) - d(xi, X3)) 

rf(?/l,2, 2/3,4) = ^(C^(a;i, X4) + d{x2,X3) - d(xi, X2) - d{x3, X4)), 

Similar observations for the other cases show that if we know the shape of 
the tree, then we can determine its edge-lengths from leaf-to-leaf distances. 
Note also that 

X(I){T) := \ (d(xi,X3) + d(x2,X4) - d(xi,X2) - C?(X3,X4)) 



(2.9) 



2 

> for shape (I), 

< for shape (II), 

= for shapes (III) and (IV) 



This and analogous inequalities for the quantities that reconstruct the 
length of the "internal" edge in shapes {II) and (HI), respectively, show 
that the shape of the tree can also be reconstructed from leaf-to-leaf dis- 
tances. □ 

2.3. Rooted M-trees. Since we are mainly interested in rooted trees, we 
extend our definition as follows: A rooted M-iree, {X,d,p), is an M-tree 
(X, d) with a distinguished point p € X that we call the root. It is helpful 
to use genealogical terminology and think of p as a common ancestor and 
/i(x) := d{p, x) as the real-valued generation to which x G X belongs (/i(x) 
is also called the height of x) . We define a partial order < on X by declaring 
(using the notation introduced in (|2.6j) ) that x < y if x £ [p,y], so that x 
is an ancestor of y. Each pair x,y (z X has a well-defined greatest common 
lower bound, x Ay, in this partial order that we think of as the most recent 
common ancestor of x and y. 

Let T"^""* denote the collection of all root-invariant isometry classes of 
rooted compact M-trees, where we define a root-invariant isometry to be an 
isometry ^ : (Xi,dxi,Pi) ^ (-^2,(^X2,^2) with ^{pi) = p2. 

We want to equip T'^°°'' with a Gromov-Hausdorff type distance that 
incorporates the special status of the root. We define the rooted Gromov- 
Hausdorff distance, dQ^Toot{{Xi, pi), {X2, P2)), between two rooted M-trees 
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{Xi,pi) and {X2,P2) as the infimum of dii{X[, X2) V dz{p'i,P2) over all 
rooted M-trees {X'i,pi) and (^2'/'2) that are root-invariant isomorphic to 
{Xi,pi) and {X2,P2), respectively, and that are (as unrooted trees) sub- 
spaces of a common metric space {Z,dz)- 

As in (|2.5|) . we can compute dQ^root((Xi, dx^, pi), {X2, dx2i P2)) by com- 
paring distances within Xi to distances within X2, provided that the dis- 
tinguished status of the root is respected. 

Lemma 2.3. For two rooted trees {Xi,dxi, pi), and {X2,dx2: P2), 

(2.10) dGH--((Xi,dxi,pi),(X2,dx2,P2)) = I inf dis(3f?™°*), 

where now the infimum is taken over all correspondences between Xi 

andX2 with {pi,p2) G 3f?™°*. 



Proof. Indeed, for any root-invariant isometric copies {X[, p[) and {X2, P2) 
embedded in Z, and r > dQ^root{(Xi, pi), (X2,P2)), 

(2.11) := {{x,,X2y, xi G X[,X2 G X^, dz(xi,X2) < r} 

gives a correspondence between Xi and X2 containing {pi,p2) such that 
dis(sft™°*) < 2r. 

On the other hand, given a correspondence between Xi and X2 

containing {pi,p2), define a metric dxiUX2 on the disjoint union Xi U X2 
such that the restriction of dxiUX2 to Xi is dxj, for i = 1,2, and for xi € Xi, 
X2 G X2, by 
(2.12) 

dxiUX2(a;i,a;2) 

:=inf{dxi(xi,yi) + dx,(x2,y2) + ^dis(5R-°*) : (yi, ^2) G 

- in particular, if the pair {xi,X2) actually belongs to the correspondence 
sjjroot^ then dxiUX2{xi,X2) = ^dis{^'^°°^). We leave it to the reader to 
check that dxiUX2 indeed, a metric. Then (computing Hausdorff distance 
within Xi 11 X2 using dxiUX2) have 

(2.13) dn{Xi,X2)\/dx,ux2{pi,P2) < ^dis{W°°'). 

□ 

We state an analogue of Theorem ^ for rooted compact M-trees. 
Theorem 2. The metric space {T^°"^ ,dQfjroot) is complete and separable. 

Before we can prove Theorem|2lwe need two preparatory results. The first 
is a the counterpart of Corollary 7.3.28 in |BB1Q1| and presents convenient 
upper and lower estimates for dQjjroot that differ by a multiplicative constant. 

Let {Xi,pi) and (^2,^2) be two rooted compact M-trees, and take e > 0. 
A map / is called a root-invariant e-isometry from {Xi,pi) to {X2,P2) if 
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f{pi) = P2, dis(/) := sup{\dx^{x,y) - dx2{f{x)J{v))\ ■ x,y £ Xi} < e and 
f{Xi) is an e-net for X2. 

Lemma 2.4. Let {Xi,pi) and (^2,^2) be two rooted compact M-trees, and 
take e > 0. Then the following hold. 

(i) // dQjjroot ((Xi, pi), (X2, P2)) < £; then there exists a root-invariant 
2e-isometry from {Xi,pi) to {X2,P2)- 

(ii) If there exists a root-invariant s-isometry from {Xi,pi) to {X2,P2), 
then 

3 

dQ^root{{Xi,pi),{X2,p2)) < -£• 

Proof, (i) Let (iQjjroot((Xi,pi), (X2,/J2)) < £• By Lemma |2.3I there exists 
a correspondence 3?™°* between Xi and X2 such that {pi, P2) G 
and dis(3?™°*) < 2e. Define / : ^ X2 by setting f{pi) = p2, 
and choosing f{x) such that {x,f{x)) G 3??''°°* for ah € Xi \ {pi}. 
Clearly, dis(/) < dis(5R™°*) < 2e. To see that f{Xi) is an 2e-net for 
X2, let X2 G ^2, and choose Xi £ Xi such that (xi,X2) G K''""*. Then 
dx,{f {xi),X2) < +dis(3?-°*) < 2e. 

(ii) Let / be a root-invariant e-isometry from {Xi,pi) to {X2,P2)- Define 
a correspondence 3?^°°* C x X2 by 

(2.14) := {{xi,X2) : dx,{x2, f{xi)) < e}. 

Then {pi,p2) € 3?™°* and 3?™°* is indeed a correspondence since f{Xi) is a 
e-net for X2. If (xi,X2), (2/1,1/2) S 3?}°°*, then 

Mjs:i(2;i,yi) - (ix2(a;2,y2)| < \dx2{f(.xi), f{yi)) - dx^{xi,yi)\ 

(2.15) + (ix2(2;2,/(2;i)) + dx2{f{xi),y2) 

< 3e. 

Hence dis(5ft}°°*) < 3e and, by dTTTl . ^gh-"' ((^1, Pi), (^2,/02)) < fe- □ 

The second preparatory result we need is the following compactness cri- 
terion, which is the analogue of Theorem 7.4.15 in |BBI01j (note also Exer- 
cise 7.4.14 in |BBI01j l and can be proved the same way, using Lemma 12.41 
in place of Corollary 7.3.28 in |BBI01j and noting that the analogue of 
Lemma O holds for T™°*. 

Lemma 2.5. A subset T C T''°°* is pre-compact if for every e > there 
exists a positive integer N{e) such that each T £ T has an e-net with at 
most N{e) points. 

Proof of Theorem\^ The proof follows very much the same lines as that 
of Theorem The proof of separability is almost identical. The key step 
in establishing completeness is again to show that a Cauchy sequence in 
rproot j^g^g ^ subsequential limit. This can be shown in the same manner as 
in the proof of Theorem with an appeal to Lemma 12.51 replacing one to 
Theorem 7.4.15 and Exercise 7.4.14 in |BBinij . □ 
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2.4. Length measure. Recall that the root growth with re-grafting dy- 
namics involve points being chosen uniformly at random on a finite tree. In 
order to extend the dynamics to general rooted compact R-trees, we will 
require the fact that rooted compact M-trees are associated with a natural 
length measure as follows. Fix (T, d, p) € x™"*, and denote the Borel-o"- field 
on T by B(T). For a,b £ T, recall the half open arc [a, b[ from (|2.6|) . and let 



the skeleton of T. Observe that if T' C T is a dense countable set, then 
(torn) holds with T replaced by T' . In particular, T° G B{T) and B{T)\^, = 
C7({]a,6[; a,b G T'}), where B{T)\^, := {Ar\T°; A G B{T)}. Hence there 
exist a unique u-finite measure fi on T, called length measure, such that 
/i(r \ T°) = and 

(2.17) ^i{]a,b[) = d{a,b), ya,beT. 

In particular, /x is the trace onto T" of one-dimensional Hausdorff measure 



Remark The terminology skeleton might seem somewhat derisory, since 
for finite trees the difference between the skeleton and the whole tree is just 
a finite number of points. However, it is not difficult to produce M-trees 
for which the difference between the skeleton and the whole tree is a set 
with Hausdorff dimension greater than one (the Brownian CRT will almost 
surely be such a tree). This explains our requirement that /x is carried by 
the skeleton. 

Remark Elements of T''°°* are really equivalence classes of trees rather 
than trees themselves, so what we are describing here is a way of associating 
a measure to each element of the equivalence class. However, this procedure 
respects the equivalence relation in that if T' and T" are two representatives 
of the same equivalence class and are related by a root-invariant isometry 
f : T' ^ T" , then the associated length measures /x' and ^u" are such that ^u" 
is the push-forward of /i' by / and p,' is the push-forward of /x" by the inverse 
of / (that is, n"{A") = fj.'{f-\A")) and fi'{A') = /x"(/(A')) for Borel sets 
A' and A" of T' and T" , respectively). 

2.5. Rooted subtrees and trimming. Recall from the Introduction that 
our strategy for extending the root growth with re-grafting dynamics from 
finite trees will involve a limiting procedure in which a general rooted com- 
pact M-tree is approximated "from the inside" by an increasing sequence 
of finite subtrees. We therefore need to establish some facts about such 
approximations . 

To begin with, we require a notation for one tree being a subtree of an- 
other, with both trees sharing the same root. We need to incorporate the fact 



(2.16) 




on T. 
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that we are dealing with equivalence classes of trees rather than trees them- 
selves. A rooted subtree of (T, d, p) G T''°°* is an element (T*,d*,p*),e T''°°* 
that has a class representative that is a subspace of a class representative 
of {T,d,p), with the two roots coincident. Equivalently, any class repre- 
sentative of {T*,d*,p*) can be isometrically embedded into any class rep- 
resentative of (T, d, p) via an isometry that maps roots to roots. We write 
T* T and note that is an partial order on T™°*. 

All of the "wildness" in a compact M-tree happens "at the leaves" . For 
example, if T € T''°°*' has a point x at which infinite branching occurs (so 
that the removal of x would disconnect T into infinitely many components), 
then any open neighborhood of x must contain infinitely many leaves, while 
for each r/ > there are only finitely many leaves y such that x € [p, y\ 
with d{x^ y) > rj. A natural way in which to produce a finite subtree that 
approximates a given tree is thus to fix r/ > and trim off the fringe of the 
tree by removing those points that are not at least distance ry from at least 
one leaf. Formally, for r/ > define Rj^ : T''°°* j;'coot ^^lat 
assigns to (T, p) € T™"* the rooted subtree {Rrj{T), p) that consists of p and 
points a ^ T for which the subtree 

(2.18) 5^''' := {x G T : a G [p,x[} 

(that is, the subtree above a) has height greater than or equal to rj. Equiva- 
lently, 

(2.19) R^{T) ■.= {xeT:3yeTxe[p,y], drix, y) > rj} U {p}. 

In particular, if T has height at most r], then Rri{T) is just the trivial tree 
consisting of the root p. 

Remark Notice that the map described in ()2.19|) maps a metric space into 
a sub-space. However, since isometric spaces are mapped into isometric 
sub-spaces, we may think of i?„ as a map from T^°°* into T™°*. 



Lemma 2.6. (i) The range of Rrj consists of finite rooted trees. 

(ii) The map i?^ is continuous. 

(iii) The family of maps {Rr])ri>o is a semigroup; that is, R^i o R^n = 
Rr,'+v" for V',V" > 0. In particular, Rr,'{T) Rr,"{T) for rj' > 
ri" > 0. 

(iv) For any {T,p) G T™°S d^n^oo^iiT, p), {Rr,{T), p)) < dn{T,R.,{T)) < 
V- 

Proof (i) Fix {T,d,p) G T''""*. Let E C Rr,{T) be the leaves of Rr,, that 
is, the points that have no subtree above them. We have to show that E 
is finite. However, if ai, 02, . . . are infinitely many points in ii^ \ {p}, then 
we can find points 5i , 62 > • • • in T such that hi is in the subtree above aj 
and d{ai,bi) > rj. It follows that mii-/,j d{bi,bj) > 2rj, which contradicts the 
compactness of T. 
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(ii) Suppose that {T',d',p') and {T" ,d" , p") are two compact trees with 

dG^.oo.{{T',p'),{T",p"))<e. 

By part (i) of Lemma |2 .41 there exists a root-invariant 2e-isometry f : T' ^ 
T". Recall that this means, f{p') = p" , dis(/) < 2e, and f{T') is an 2e-net 
for T". 

For a € Rrj(T'), let /(a) be the unique point in Rrj{T") that is closest 
to /(a). We will show that / : Rrj{T') Rj^(T") is a root-invariant 25e- 
isometry and hence, by part (ii) of Lemma 12.41 d^jjioot (ii,y(r'), i?^(T")) < 
p5e. 

We first show that 

(2.20) snp{d"{f{a), f{a)) : a € R,{T')} < 8e. 

Fix a £ Rrj{T') and let 6 G T' be a point in the subtree above a such that 
d'{a,b) > rj. Denote the most recent common ancestor of f{a) and f{b) on 
T" by /(a) A" f{b). 

f b 

' a fia) A" f{b) 

. P' 

Figure 3 illustrates the shapes of the trees spanned by {p',a,b} and by 
{p" , f (a) , f (b)} . The point f{a) lies somewhere on the arc [p",f{a)]. 

Then 

d"(/(a) A" fib), f (a)) 

= \ id" if (a), fib)) + d"ip", fia)) - d"{p", fib))) 

(2.21) <^i\d"ifia),fib))-d'ia,b)\ 

+ \d"ip",fia)) - d'ip',a)\ + \d"ip",fib)) - d'ip',b)\) 

< 3e. 

If fia) £ [fia) a" fib), fia)] then we are immediately done. Otherwise, 
fia) e [p",f ia)] and /(a) is a leaf in R^iT"). Hence fib) ^ Rr,iT"), and 
therefore 



(2.22) 



d"ifia),fib))<rj. 
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Furthermore, 

d!' ifia) A"f{b),fib)) = d" if (a), fib)) - d" (/(a) A7(6),/(a)) 

(2.23) > id'ia,b) -2e) -3e 

> 7] — 5s. 

Combining (IT^ . (fT^ and finally yields that (i"(/(a), /(a)) < 8e 

and completes the proof of (|2.20() . 
It follows from ^^TM that 

dis(/) = sup{|d'(a,6) - d"ifia),fib))\ : a,b e i?,(r')} 
< sup{|d'(a,6) - d"ifia)Jib))\ : a,b e i?,(r')} 

+ 2supK(/(a),/(a)) :aGi?„(^')} 
<2e + 2x8e 
= 18e. 

The proof of (ii) will thus be completed if we can show that /(i?,j(T')) is 
a 25e-net in i?^(r"). Consider a point c G Rr^iT"). We need to show that 
there is a point 6 G Rr^iT') such that 

(2.24) <i"(/(6),c) <25e. 

If d"ip'\c) < 7e, then we are done, because we can take b = p' (recall 
that fip') = p"). Assume, therefore, that d"ip",c) > 7e. We can then 
find points c_ , c+ G T" such that p" < < c < c+ with d" (c_ , c) = 7e 
and d"(c, C4.) > r?. There are corresponding points a_,a, a+ G T' such that 
d"(/(a_),c„) < 2e, d"(/(a),c) < 2e, and d"(/(a+),c+) < 2e. We claim 
that b := a- a' (the most recent common ancestor of a_ and a_|- in the 
tree T') belongs to Rr,iT') and satisfies ^TM . 
Note first of all that 

d'ib, a+) = d'ia- a' a+, a+) 

= ^ (d'(a+,a_) + d'(p',a+) - d'ip,a^)) 

> \ (d"(/(a+), /(a_)) - 2e + d"(/(pO, /(a+)) - 2s 
-d"(/(p'),/(a-))-2e) 

> ^ (d"(c+, C-) - 4e + (i"(/, c+) -2e- dl'ip",^.) - 2e) - 3e 

= d"(c+,c_) -7e 
= r/ + 7e - 7e 

and so 6 G i?^(T'). 
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Furthermore, 

d"ic,fib)) < d"ic,c^) + d"{c.,f{a.)) + d"if{a^)J{b)) 

< 7e + 2e + d'{a^,b) + 2e 

= lle + ^ (ci'(a+,a_) + d'{p',a-) -d'{p',a^)) 

< lie + ^ (d"(/(a+), /(a^)) + 2e + d"{fip'), /(a_)) + 2e 
-d'{f{p')J{a+)) + 2e) 

< 14e + ^ (d"(c+, c_) + 2e + c_) + 2e - c+) + 2e) 
= lie. 

Therefore, by ^HTM . 

d{cj{h)) < ne + Se = 25e. 

This completes the proof of ()2.24|) . and thus the proof of part (ii). 

Claims (iii) and (iv) are clear. □ 

Finally, we require the following result, which will be the key to showing 
that the "projective limit" of a consistent family of tree- valued processes 
can actually be thought of as a tree- valued process in its own right. 

Lemma 2.7. Consider a sequence {Tn)n&i of representatives of isometry 
classes of rooted compact trees in (T,dQjjroot) with the following properties. 

• Each set Tn is a subset of some common set U. 

• Each tree Tn has the same root p ^ U . 

• The sequence {Tn)neN is nondecreasing, that is, Ti C r2 C • • • C [/. 

• Writing dn for the metric on Tn, for m < n the restriction of dn 
to Tm coincides with dm, so that there is a well-defined metric on 
T ■■= UneN^n aiven by 

(2.25) d{a,b) = dn{a,b), a,beTn. 

• The sequence of subsets {Tn)neN is Cauchy in the Hausdorff distance 
with respect to d. 

Then the following hold. 

(i) The metric completion TofTisa compact M-tree, and d}i{Tn,T) — > 
as n ^ oo, where the Hausdorff distance is computed with respect 
to the extension of d to T. In particular, 

(2.26) lim d^^.oo,{{Tn,p), {T,p)) = 0. 

n — ^oo 

(ii) The tree T has skeleton T° = IJneN^n- 

(iii) The length measure on T is the unique measure concentrated on 
UnsN-^?? ^^^^ restricts to the length measure on Tn for each n E N. 
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Proof, (i) Because T is a complete metric space, the collection of closed 
subsets of T equipped with the Hausdorff distance is also complete (see, 
for example, Proposition 7.3.7 of BBIOl ). Therefore the Cauchy sequence 
{Tn)neN has a limit that is (see Exercise 7.3.4 of |BBinij ) the closure of 
UfceN itself. It is clear that the complete space T is totally bounded, 

path-connected, and satisfies the four point condition, and so T is a compact 
M-tree. Finally, 

(2.27) dQ^root{{Tn,p),{f,p))<dn{Tn,f)yd{p,p)=dn{Tn,f)^0, 
as n ^ oo. 

Claims (ii) and (iii) are obvious. □ 

3. Root growth with re-grafting 

3.1. Beginning the construction. We are now ready to begin in earnest 
the construction of the T™°*-valued Markov process, X, having the root 
growth with re-grafting dynamics. 

Fix a tree {T,d,p) G T™°*. This tree will be the initial state of X. We 
first recapitulate the strategy outlined in the Introduction. In line with that 
semi-formal description, the "stochastic inputs" to the construction of X 
will be a collection of cut times and a corresponding collection of cut points. 

• Construct simultaneously for each finite rooted subtree T* T a 
process X-^* with Xq* = T* that evolves according the root growth 
with re-grafting dynamics. 

• Carry out this construction in such a way that if T* and T** are two 
finite subtrees with T* T**, then Xj"* X]^** and the cut 
points for X"^ are those for X^** that happen to fall on for a 
corresponding cut time r of X^**. Cut times r for X^** for which 
the corresponding cut point does not fall on Xj_! are not cut times 
for X'^* . 

• The tree (T, p) is a rooted Gromov-Hausdorff limit of finite IR-trees 
with root p (indeed, any subtree spanned by a finite e-net and p is 
finite and has rooted Gromov-Hausdorff distance less than e from 
(T,p)). In particular, {T,p) is the "smallest" rooted compact R-tree 
that contains all of the finite rooted subtrees of {T,p). 

• Because of the consistent projective nature of the construction, we 
can define Xt := Xj' for t > as the "smallest" element of T'"""* 
that contains , for ah finite trees T* T. 

It will be convenient for establishing features of the process X such as the 
strong Markov property to introduce randomness later and work initially in 
a setting where the cut times and cut points are fixed. There are two types 
of cut points: those that occur at points which were present in the initial tree 
T and those that occur at points which were added due to subsequent root 
growth. Accordingly, we consider two countable subsets ttq C M"*"^ x T° and 
TT C {(t, x) € X : X < t}. (Once again we note that we are moving 
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backwards and forwards between thinking of T as a metric space or as an 
equivalence class of metric spaces. As we have written things here, we are 
thinking of ttq being associated with a particular class representative, but 
of course ttq corresponds to a similar set for any representative of the same 
equivalence class by mapping across using the appropriate root invariant 
isometry.) 

Assumption 3.1. Suppose that the sets ttq and tt have the following prop- 
erties. 

(a) For all to > 0, each of the sets 7ron{{to} xT°) and irn{{to}x]0,to]) 
has at most one point and at least one of these sets is empty. 

(b) For all to > and all finite subtrees T' C T, the set ttq H (]0, to] ^ T') 
is finite. 

(c) For all to > 0, the setirn{{t,x) G M++ x M++ : x < t < to} is finite. 



Remark Conditions (a)-(c) of Assumption |S3 will hold almost surely if ttq 
and vr are realizations of Poisson point processes with respective intensities 
A ® /i and X0 X (where A is Lebesgue measure) , and it is this random mech- 
anism that we will introduce later to produce a stochastic process having 
the root growth with re-grafting dynamics. 

It will be convenient to use the notations ttq and it to also refer to the 
integer-valued measures that are obtained by placing a unit point mass at 
each point of the corresponding set. 

Consider a finite rooted subtree T* T. It will avoid annoying cir- 

cumlocutions about equivalence via root invariant isometries if we work with 
particular class representatives for T* and T, and, moreover, suppose that 
T* is embedded in T. 

Put Tq := 0, and let < < r| < . . . (the cut times for X'^*) be the 
points of {t > : 7ro({t} x T*) > 0} U {t > : 7r({t} x M++) > 0}. 

An explicit construction of Xf* is then given in two steps: 

Step 1 (Root growth) At any time t > 0, Xj" as a set is given by the 
disjoint union r*n]0,t]. The root of Xf* is the point pt := t £]0,t]. The 
metric dj on Xf is defined inductively as follows. Set to be the metric 
on Xj* = T*; that is, dg* is the restriction of d to T*. Suppose that dJ* 
has been defined for < t < r*. Define dJ for t* < t < r*^_^ by 

(dr*{a,b), ifa,6GX^*, 

(3.1) dr{a,b) := <\b-a\, if a,b e]T*"t], 

Ja-T*\+dr*iPr:i,b), if a e]T*,t], b e X^: . 

Step 2 (Re- Grafting) Note that the left-limit X'E* _ exists in the rooted 
Gromov-Hausdorff metric. As a set this left-limit is the disjoint union 

(3.2) x5*u]<,<+i] =r*n]o,<+i], 
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and the corresponding metric dr* - is given by a prescription similar to 

(EH)- 

Define tlie (n + 1)*^* cut point for X'^* by 



(3.3) 



\aeT*, if^o({«+i,a)}) >0, 

i^G]0,r*+i], if^({(r*^i,x)})>0. 

Let 5* , 1 be the subtree above p* , i in X-^,* , that is, 

(3.4) 5:+i := {b G : K+i ^ 

Define the metric d-r* by 

(3.5) 

dr„v,-(«,^), ifa,6GX^;^\5*+i, 
In other words is obtained from Xj, by pruning off the subtree 

^n+l ^n + 1 

5*^_]^ and re-attaching it to the root. 

Now consider two other finite, rooted subtrees {T** , p) and {T*** , p) of T 
such that T* U T** C T*** (with induced metrics). Build X'^" and X^"* 
from ttq and vr in the same manner as X'^* (but starting at T** and T***). 
It is clear from the construction that: 

• Xj"* and Xf** are rooted subtrees of xj"*** for all t > 0, 

• the Hausdorff distance between Xf* and Xf" as subsets of xj""" 
does not depend on T***, 

• the Hausdorff distance is constant between jumps of X^ and X'^" 
(when only root growth is occurring in both processes). 

The following lemma shows that the Hausdorff distance between Xf* and 
xf** as subsets of xf*** does not increase at jump times. 

Lemma 3.2. Let T be a finite rooted tree with root p and metric d, and 
let T' and T" be two rooted subtrees of T (both with the induced metrics and 
root p). Fix p G T, and let S be the subtree in T above p (recall i^-4\ ))- 
Define a new metric d on T by putting 

d{a, 6), i/ a, 6 G S, 

d{a,b) := <d{a,b), ifa,beT\S, 

d{a,p) + d{p,b), ifa&S,b&T\S. 

Then the sets T' and T" are also subtrees of T equipped with the induced 
metric d, and the Hausdorff distance between T' and T" with respect to d is 
not greater than that with respect to d. 



22 



STEVEN N. EVANS, JIM PITMAN, AND ANITA WINTER 



Proof. Suppose that the Hausdorff distance between T' and T" under d is 
less than some given e > 0. Given a £ T' , there then exists b G T" such that 
d{a,b) < e. Because d{a,a A 6) < d{a,b) and a A 6 G T", we may suppose 
(by replacing 6 by a A 6 if necessary) that b < a. We claim that d{a, c) < e 
for some c (zT" . This and the analogous result with the roles of T' and T" 
interchanged will establish the result. 

If a,b £ S OT a,b £ T \ S, then d{a, b) = d{a, b) < e. The only other 
possibility is that a € S and 6 G T \ 5, in which case p £ [b, a] (for T 
equipped with d). Then d{a, p) = d{a,p) < d{a,b) < e, as required (because 
p £ T"). □ 

Now let Ti C T2 C • • • be an increasing sequence of finite subtrees of 
T such that UneN^n dense in T. Thus linin^oo dH{Tn,T) = 0. Let 
X^,X'^, ... be constructed from ttq and vr starting with Ti, T2, . . .. Applying 
Lemma 13.21 yields 

(3.6) lim sup dniiroot{X^,Xj') = 0. 

m,n— >oo 

Hence by completeness of T''°°*, there exists a cadlag T''°°*-valued process 
X such that Xq = T and 

(3.7) lim sup dn^root{X]^,Xt) = 0. 

m^°o t>0 

A priori, the process X could depend on the choice of the approximating 
sequence of trees (Tn)ni=N- To see that this is not so, consider two approxi- 
mating sequences T^^ C C • • • and C T| C • • • . For k £ N, write 
for the smallest rooted subtree of T that contains both and T^. As a set, 
= T^L) T^. Now let ((X"'*)j>o)neN for i = 1,2,3 be the corresponding 
sequences of finite tree-value processes and let {X^'^)t>o for i = 1,2,3 be 
the corresponding limit processes. By Lemma 13.21 



(3.8) 



<(^H(x^'^x^'^)+(^H(x^•^x^'^) 

<dH(ri,r3) + <iH(T2,r3) 
<dH(T;\T) + dH(r2,r) ^0 



as n — > 00. Thus, for each t > the sequences {X^'^)nen and (X"'^)neN 
do indeed have the same rooted Gromov-Hausdorff limit and the process X 
does not depend on the choice of approximating sequence for the initial tree 
T. 

3.2. Finishing the construction. In 13.11 we constructed a T'^°°*-valued 
function t ^ Xt starting with a fixed triple (T, ttq , tt) , where T £ T™"* and 
7ro,7r satisfy the conditions of Assumption 13. 11 We now want to think of X 
as a function of time and such triples. 
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Let i}* be the set of triples (T, 7ro,7r), where T is a rooted compact M- 
tree (that is, a class representative of an element of T™"*) and vro,7r satisfy 
Assumption 13.11 

The root invariant isometry equivalence relation on rooted compact M- 
trees extends naturally to an equivalence relation on Q* by declaring that 
two triples {T',ttq,-k') and (T", ttq , vr"), where ttq = {{a^,x[) : i G N} and 
ttq = {((t",x'/) : i G N}, are equivalent if there is a root invariant isometry 
/ mapping T' to T" and a permutation 7 of N such that a'- = cr^j-j^ and 

x'l = /(a;!y(j)) for all i G N. We write Q, for the resulting quotient space of 
equivalence classes. 

In order to do probability, we require that O has a suitable measurable 
structure. We could do this by specifying a metric on Q, but the following 
approach is a little less cumbersome and suffices for our needs. 

Let Q,^^ denote the subset of Q consisting of triples (T, ttq, vr) such that T, 
ttq and vr are finite. We are going to define a metric on Q^^. Let (T'jTTqjTt') 
and (T",7rQ,7r") be two points in Q^^, where vtq = {{a[, x'l), . . . ,{ap,Xp)}, 
tt' = {r( , . . . , r;}, < = {«, x'l), . . . , «, x',')}, and vr" = {rf , . . . , r'J}. As- 
sume that < a[ < ■ ■ ■ < a'p, < t[ < ■ ■ ■ < T^, < a" < ■ ■ ■ < a'q, and 
< r(' < • • • < The distance between (r',vr^,vr') and (T", vr^', vr") will 
be 1 if either p ^ q ox r ^ s. Otherwise, the distance is 

(3.9) lA^^ inf dis(r°°*'""*")+max|CT^-a^'I + max|T'-r'' 

\ 2 3^root,cuts ^ j 

where the infimum is over all correspondences between T' and T" that con- 
tain the pairs {pt'^Pt") and (x'j,x") for 1 < z < p. 

Equip with the Borel cr-field corresponding to this metric. For t > 0, 
let J-° be the a-field on O generated by the family of maps from $7 into Q,^'^ 
given by (r,vro,vr) ^ (i?^(r), vro n (]0, i] x (i?^(r))°), vrn {(s, x) : x < s < t}) 
for 77 > 0. As usual, set := n„>t for t > 0. Put T° := Vt>o ^t- 

It is straightforward to establish the following result from Lemma [2. 61 and 
the construction of X in Subsection 13.11 and we omit the proof. 

Lemma 3.3. The map (t, (T, vro, vr)) Xt{T, vtq, vr) from R+ xU into T™°* 
is progressively measurable with respect to the filtration {T°)t>Q- (Here, of 
course, we are equipping T™°* with the Borel a-field associated with the 
metric c^q jjroot.j 

Given T G T™°*, let P"^ be the probability measure on O defined by the 
following requirements. 

• The measure P"^ assigns all of its mass to the set {(T', vtq, vr') G ft : 
T = T}. 

• Under P^, the random variable (T',vrQ,vr') 1— > vrg is a Poisson point 
process on the set M"*"^ x T° with intensity A <8> /x, where p is the 
length measure on T. 
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• Under P"^, the random variable {T',ttq,tt') i-^ vr' is a Poisson point 
process on the set {(t, x) E M"'""'' x M"*""*" : x <t} with intensity A (8) A 
restricted to this set. 

• The random variables (T',7rQ,7r') i— > vtq and (T',7rQ,7r') i— > vr' are 
independent under P"^. 

Of course, the random variable (r',7rQ,7r') i— > ttq takes values in a space of 
equivalence classes of countable sets rather than a space of sets per se, so, 
more formally, this random variable has the law of the image of a Poisson 
process on an arbitrary class representative under the appropriate quotient 
map. 

For t > 0, 5 a bounded Borel function on T™°*, and T G T''""*, set 

(3.10) Ptg{T):=P''[g{Xt)]. 

With a slight abuse of notation, let ii^ for 77 > also denote the map from 
n into n that sends (r,7ro,7r) to (i?,,(r),7ro n (M++ x (R;,(r))°), vr). 
Our main construction result is the following. 

Theorem 3. (i) // T G T''°°* is finite, then {Xt)t>o under P^ is a 
Markov process that evolves via the root growth with re- grafting dy- 
namics on finite trees. 

(ii) For all rj > and T G T"^""*, the law of {Xt o Rr,)t>o under P^ 
coincides with the law of {Xt)t>o under p^vC^) _ 

(iii) For allT G T™°*, the law of {Xt)t>o under P^'''^'^^ converges as rj I 
to that of {Xt)t>o under P^ (in the sense of convergence of laws on 
the space of cadlag T^°°^ -valued paths equipped with the Skorohod 
topology). 

(iv) For g G bi3(T™°*), the map {t,T) ^ Ptg{T) is S(R+) x B{T'°°^)- 
measurable. 

(v) The process {Xt,P'^) is strong Markov with respect to the filtration 
{T^)t>o o.'nd has transition semigroup {Pt)t>o- 

Proof, (i) This is clear from the definition of the root growth and re-grafting 
dynamics. 

(ii) It is enough to check that the push-forward of the probability measure 
P^ under the map i?^ : — > is the measure P-^''(^). This, however, 
follows from the observation that the restriction of length measure on a tree 
to a subtree is just length measure on the subtree. 

(iii) This is immediate from part (ii) , the limiting construction in Subsection 
13.11 and part (iv) of Lemma 12.61 Indeed, we have that 

(3.11) supdQ^root{Xt,Xt o R^) < dn{T,R^{T)) < rj. 

(iv) By a monotone class argument, it is enough to consider the case where 
the test function g is continuous. It follows from part (iii) that Ptg{Rrj{T)) 
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converges pointwise to Ptg(T) as ], 0, and it is not difficult to show using 
Lemma El and part (i) that (t,r) ^ Ptg{Rrj{T)) is B{R+) x S(T™°*)- 
measurable. We omit the details, because we will establish an even stronger 
result in Proposition 16.11 

(v) By construction and part (ii) of Lemma 12.71 we have for t > and 
(T, 7ro,vr) G that, as a set, X°{T,TTQ,-ir) is the disjoint union T^UJO,*]. 
Put 

9t{T, 7ro,7r) 



Thus 6t maps into 0,. Note that Xs o 6^ = Xg^t and that Og o 9t = Os+t, 
that is, the family {Ot)t>o is a semigroup. It is not hard to show that 
{t, (r, 7ro,7r)) 1-^ 0t(T, 7ro,7r) is jointly measurable, and we leave this to the 
reader. 

Fix t > and (T, 7ro,vr) G fi. Write /i' for the measure on T°n]0,t] that 
restricts to length measure on T° and to Lebesgue measure on ]0,t]. Write 
/i" for the length measure on X°(T, ttq, vr). The strong Markov property will 
follow from a standard strong Markov property for Poisson processes if we 
can show that ^' = /x". This equality is clear from the construction if T is 
finite: the tree Xt{T, ttq, vr) is produced from the tree T and the set ]0, t] by 
a finite number of dissections and rearrangements. The equality for general 
T follows from the construction and part (iii) of Lemma 12.71 □ 

4. Connection with Aldous's construction of the CRT 

Let {TZt)t>o, {%)t>o, and (r„)„gN be as in the Introduction. Thus {Tt)t>Q 
has the same law as {Xt)t>Q under P"^'', where Tq is the trivial tree. 

Proposition 4.1. The two random finite rooted trees T^m- o-nd have 
the same distribution for all n £N. 

Proof. Let denote the object obtained by taking the rooted finite tree 
with edge-lengths T^m- and labeling the leaves with 1, . . . ,n, in the order 
that they are added in Aldous's construction. Let T„ be derived similarly 
from the rooted finite tree with edge-lengths T^^-, by labeling the leaves 
with 1,... ,n in the order that they appear in the root growth with re- 
grafting construction. It will suffice to show that i?„ and T„ have the same 
distribution. Note that both i?„, and are rooted, bifurcating trees with 
n labeled leaves and edge-lengths. Such a tree 5„ is uniquely specified by 
its shape, denoted shape(6'„), which is a rooted, bifurcating, leaf-labeled 



(3.12) 




{(s, x) G M++ X M++ : (t + s, t + x) G vr} 
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combinatorial tree, and by the list of its (2n — 1) edge-lengths in a canonical 
order determined by its shape, say 

lengths(5„) := (length(5„, 1), . . . , length(5„, 2n - 1)), 

where the edge- lengths are listed in order of traversal of edges by first work- 
ing along the path from the root to leaf 1, then along the path joining that 
path to leaf 2, and so on. 

Recall that t„ is the nth point of a Poisson process on with rate t dt. 
We construct Rn and r„ on the same probability space using cuts at points 
UiTi, 1 < i < n — 1, where Ui,U2, ■ ■ ■ is a sequence of independent random 
variables uniformly distributed on the interval ]0, 1] and independent of the 
sequence Then, by construction, the common collection of edge- 

lengths of Rn and of T„ is the collection of lengths of the 2n — l subintervals 
of ]0, r„] obtained by cutting this interval at the 2n — 2 points 

{Xf \ 1 < i < 2n - 2} := y {Um, n} 

i=l 

where the X^"^^ are indexed to increase in i for each fixed n. Let Xg"^ ^ 
and ^2n-i •— '''n- Then 

(4.1) length(i?„, i) = Xf'^ - x'lX 1 < « < 2n - 1, 

(4.2) length(r„, i) = length(ii„, a„,i), 1 < i < 2n - 1, 

for some almost surely unique random indices an,i G {1, ... 2n — 1} such 
that i I— > an,i is almost surely a permutation of {1, ... 2n — 1}. According to 
|Ald93| Lemma 21], the distribution of Rn may be characterized as follows: 

(i) the sequence lengths(i?„) is exchangeable, with the same distribution 
as the sequence of lengths of subintervals obtained by cutting ]0, r„] 
at 2n — 2 uniformly chosen points {UiTn '■ 1 < i <2n — 2}; 

(ii) shape(i?ri) is uniformly distributed on the set of all 1 x 3 x 5 x • • • x 
(2n — 3) possible shapes; 

(iii) lengths(i?„) and shape(i?rt) are independent. 

In view of this characterization and (|4.2j) . to show that T„ has the same 
distribution as i?„ it is enough to show that 

(a) the random permutation {i i-^ an,i ■ l<i<2n — l}isa function of 
shape(T„); 

(b) shape(Tn) = ^'„(shape(i?„)) for some bijective map from the set 
of all possible shapes to itself. 

This is trivial for n = 1, so we assume below that n > 2. Before proving (a) 
and (b), we recall that (ii) above involves a natural bijection 



(4.3) 



(/i, . . . , In-i) ^ shape(i?„) 
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where € {1, . . . , 2n — 3} is the unique i such that G 

^^(n^i)^^(n-i)^^ Hence In~i is the index in the canonical ordering of edges 
of Rn-i of the edge that is cut in the transformation from Rn-i to Rn by at- 
tachment of an additional edge, of length t„ — r„_i, connecting the cut-point 
to leaf n. Thus (ii) and (iii) above correspond via ()4.3|) to the facts that 
Ii, . . . , In-i are independent and uniformly distributed over their ranges, 
and independent of lengths(ii„). These facts can be checked directly from 
the construction of {Rn)nm from (T„)„gN and (?7n)neN using standard facts 
about uniform order statistics. 

Now (a) and (b) follow from (|4.3)) and another bijection 

(4.4) ^shape(r„) 

where each possible value i of Im is identified with edge am,i in the canon- 
ical ordering of edges of T^. This is the edge of whose length equals 
length.{Rm,i)- The bijection (|4.4|) . and the fact that an,i depends only on 
shape(T„), will now be established by induction on n > 2. For n = 2 the 
claim is obvious. Suppose for some n > 3 that the correspondence between 
(/i, . . . ,/ra-2) and shape(T„_i) has been established, and that the length of 
edge cTn-i,j in the canonical ordering of edges of T„_i is equals the length of 
the ith edge in the canonical ordering of edges of Rn-i, for some <Tn-i,i which 
is a function of i and shape(T„_i). According to the construction of if 
In-i = i then is derived from T„_i by splitting Tn-i into two branches 
at some point along edge Cn-i^i in the canonical ordering of the edges of 
Tn-i, and forming a new tree from the two branches and an extra segment 
of length Tn — Tn-i- Clearly, shape(T„) is determined by shape(T„_i) and 
In-i, and in the canonical ordering of the edge-lengths of T„ the length of 
the ith edge equals the length of the edge an,i of Rn, for some an,i which is 
a function of shape(T„_i) and In-i, and hence a function of shape(T„). To 
complete the proof, it is enough by the inductive hypothesis to show that 
the map 

(shape(r„,_i),/„_i) shape(r„) 

just described is invertible. But shape(T„_i) and can be recovered 
from shape (T„) by the following sequence of moves: 

• delete the edge attached to the root of shape(r„) 

• split the remaining tree into its two branches leading away from the 
internal node to which the deleted edge was attached; 

• re-attach the bottom end of the branch not containing leaf n to leaf 
n on the other branch, joining the two incident edges to form a single 
edge; 

• the resulting shape is shape(T„_i), and is the index such that 
the joined edge in shape(r„_i) is the edge crn-i,i„_i in the canonical 
ordering of edges on shape(r„_i). 

□ 
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5. Recurrence and convergence to stationarity 

Lemma 5.1. For any {T,d,p) G T''""* we can build on the same probability 
space two T^°°^ -valued processes X' and X" such that: 

• X' has the law of X under P-^" , where Tq is the trivial tree consisting 
of just the root, 

• X" has the law of X under P-^, 

• for all t > 0, 

(5.1) dQ^root{X[, X[') < dQjjroot (To , T) = sup{d{p,x) : x G T} 

(5.2) lim dr^TTroot (X^, X^') = 0, almost surely. 

t— >oo 

Proof. The proof follows almost immediately from construction of X in Sec- 
tion|31and Lemma [,'121 The only point requiring some comment is (|5.2|) . For 
that it will be enough to show for any e > that for P-^-a.e. (T, ttq, vr) G ft 
there exists t > such that the projection of ttq H (]0, t] x T°) onto T is an 
e-net for T. 

Note that the projection of ttq (1 (\0,t] x T") onto T is a Poisson process 
under P^ with intensity tfi, where /x is the length measure on T. Moreover, 
T can be covered by a finite collection of e-balls, each with positive fi- 
measure. Therefore, the P-^-probability of the set of (T, 7ro,7r) G Q such 
that the projection of ttq D (]0, t] x T°) onto T is an e-net for T increases as 
t ^ oo to 1. □ 

Proposition 5.2. For any T G i"""""*, the law of Xt under converges 
weakly to that of the Brownian CRT as t ^ oo. 

Proof. It suffices by Lemma l5.1l to consider the case where T is the trivial 
tree. 

We saw in the Proposition 14.11 that, in the notation of the Introduction, 
7^,j_ has the same distribution as TZr„-. Moreover, we recalled in the In- 
troduction that TZt converges in distribution to the continuum random tree 
as t ^ CX3 if we use Aldous's metric on trees that comes from thinking 
of them as closed subsets of with the root at the origin and equipped 
with the Hausdorff distance. By construction, ('7^)t>o has the root growth 
with re-grafting dynamics started at the trivial tree. Clearly, the rooted 
Gromov-Hausdorff distance between % and 7^„^i_ is at most r„+i — r„ for 
Tn l£ t < Tn+i- It remains to observe that r^+i — — > in probability as 
n — > oo. □ 

Proposition 5.3. Consider a non-empty open set U C x™°*. For each 

rp g rproot 

(5.3) P'^{/or all s > 0, there exists t > s such that Xt G U} = 1. 



REAL TREES 



29 



Proof. It is straightforward, but notationally rather tedious, to show that if 
B' C T™°* is any ball and Tq is the trivial tree, then 

(5.4) P^^'iXt £B'}>0 

for all t sufficiently large. Thus, for any ball B' C T''°°* there is, by 
Lemma l5. 11 a ball B" C T™°* containing the trivial tree such that 

(5.5) inf P^{Xt eB'}>0 

T£B" 

for each t sufficiently large. 

By a standard application of the Markov property, it therefore suffices to 
show for each T £ X''°°* and each ball B" around the trivial tree that 

(5.6) P^{there exists t > such that Xt G B"} = 1. 

By another standard application of the Markov property, equation 1)5. 6() 
will follow if we can show that there is a constant p > depending on B" 
such that for any T G T''""* 

liminfP^{Xt G B"} > p. 

This, however, follows from Proposition 15. 21 and the observation that for any 
e > the law of the Brownian CRT assigns positive mass to the set of trees 
with height less than e (which is just the observation that the law of the 
Brownian excursion assigns positive mass to the set of excursion paths with 
maximum less that e/2). □ 

Proposition 5.4. The law of the Brownian CRT is the unique stationary 
distribution for X . That is, if^ is the law of the CRT, then f ^,{dT)Ptf{T) = 
f C{dT)f{T) for allt>0 and f G b^(T'^°°*), and ^ is the unique probability 
measure on T''""* with this property. 

Proof. This is a standard argument given Proposition 15.21 and the Feller 
property for the semigroup {Pt)t>o established in Proposition 16. H but we 
include the details for completeness. 

Consider a test function / : T™°* — > M that is continuous and bounded. 
By Proposition 16.11 below, the function Ptf is also continuous and bounded 
for each t >0. Therefore, by Proposition 15. 2| 



(5.7) 



/ ?(dT)/(r) = lim / adT)PsfiT) = lim / i{dT)Ps+tf iT) 
J ^^^^ J J 

= lim / adT)PsiPtf){T) = [ i{dT)Ptf{T) 



for each t > 0, and hence ^ is stationary. Moreover, if C is a stationary 
measure, then 



(5.8) 



C(dr)/(r) = J cidT)Ptf{T) 

^ J CidT) (^j adT)f{T)^ = J adT)f{T), 
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and C = as claimed. □ 

6. Feller property 

The following result says that the law of Xt under P'^ is weakly continuous 
in the initial value T for each t >0. This property is sometimes referred to 
as the Feller property of the semigroup (-Pt)t>o, although this terminology 
is often restricted to the case of a locally compact state space and transition 
operators that map the space of continuous functions that vanish at infinity 
into itself. A standard consequence of this result is that the law of the 
process {Xt)t>Q is weakly continuous in the initial value (when the space of 
cadlag T''°°*-valued paths is equipped with the Skorohod topology). 

Proposition 6.1. // the function f : T''°°*' — > M is continuous and bounded, 
then the function Pff is also continuous and bounded for each t >0. 

We will prove the proposition by a coupling argument that, inter alia, 
builds processes with the law of X under P"^ for two different finite values 
of T on the same probability space. The key to constructing such a coupling 
is the following pair of lemmas. 

We require the following notion. A rooted combinatorial tree is just a con- 
nected, acyclic graph with one vertex designated as the root. Equivalently, 
we can think of a rooted combinatorial tree as a finite rooted tree in which 
all edges have length one. Thus any finite rooted tree is associated with a 
unique rooted combinatorial tree by changing all the edge lengths to one, 
and any two finite rooted trees with the same topology are associated with 
the same rooted combinatorial tree. If U and V are two rooted combinato- 
rial trees with leaves labeled by {xi, . . . , Xn) and (yi, . . . , then we say 
that U and V are isomorphic if there exists a graph isomorphism between U 
and V that maps the root of U to the root of V and Xi to yi for 1 < i < n. 

Lemma 6.2. Let {T,p) be a finite rooted trees with leaves {xi, . . . ,Xn} = 
T\T°. (recall the definition of the skeleton T° from \2.1b}) ). Write r/ for the 
minimum of the (strictly positive) edge lengths inT. Suppose that {T',p') is 
another finite rooted tree with c^q jjroot ((T', p'), (T, /))) < 5 < jg. Then there 
exists a subtree {T",p') (T',p') and a map f : T ^ T" such that: 

(i) fip) =P', 

(ii) T" is spanned by {/(xi), . . . , fixn),p'}, 

(iii) duiT',T") <35, 

(iv) dis(/) <86, _ 

(v) T" has leaves {f{xi), . . . , /(x„)}, 

(vi) by possibly deleting some internal edges from the rooted combinato- 
rial tree associated to T" with leaves labeled by (/(xi), . . . , /(x„)), 
one can obtain a leaf-labeled rooted combinatorial tree that is iso- 
morphic to the rooted combinatorial rooted tree associated to T with 
leaves labeled by (xi, . . . , x„). 
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Proof. We have from (|2.1()|) that there is a correspondence containing 
{p,p') between T and T' such that dis(3f?™°*) < 26. For x G T \ {p}, choose 
fix) G r such that (x, fix)) E and put /(p) := p' . Set T" to be the 

subtree of T' spanned by {/(xi), . . . , fixn),p'}- For x G T define /(x) G T" 
to be the point in T" that has minimum distance to fix). In particular, 
fip) = fip) = p' and /(xj) = /(xj) for all i, so that (i) and (ii) hold. 

For x' G T' \ {p'} choose ^(x') such that igix'),x') G and put 

gip') := p. Then /(T) and giT') are 25-nets for T' and T, respectively, 
and dis(/) V dis((7) < 25. For each i G {!,..., n} and y' G T' we have 
ip,p'),ixi,fix,)),igiy'),y') G 5?™°*. Hence dT'ip',y') < drip, giy')) + 26, 
and dx' iy' , f ixi)) < drigiy'), Xi) + 2J. Now fix y' G T', and choose i G 
{1, . . . ,n} such that giy') G [/?, Xj]. Then 

dr'ip'Jixi)) + 2dH ({y'}, [p,/(xi)]) = dx'ip'^y) + dx'iy Jixi)) 
(6.1) < (iT(p,Xi) +45 

and hence dY{i{y'},T") < 36. Thus (iii) holds. 

For x,y & T, 
(6.2) 

\dTix,y)-dT>,ifix)Jiy))\ 

< \dTix,y)-dT'ifix),fiy))\ + dT'ifix),fix))+dT'ifiy),fiy)) 
<dis(/) + 2dH(T',r) <85, 
and (iv) holds. 

In order to establish (v), it suffices to observe for 1 < i 7^ j < n that, by 
part (iv), 

dT"ifiXi),fiXj)) +dT"ifixj),p) - dT"ifixi),p') 

> dTixi,Xj) + dTixj,p) - drixup) - 3dis(/) 
>2r]-2A6 

> 0. 

Similarly, part (vi) follows from part (iv) and the observations in Sub- 
section 12.21 about re-constructing tree shapes from distances between the 
points in subsets of size four drawn from the leaves and the root of T" once 
we observe the inequality ^4dis(/) < 166 < rj. □ 

Lemma 6.3. Let (T, p) be a finite rooted tree and e > 0. There exists 
6 > depending on T and e such that if iT',p') is a finite rooted tree 
with dQ^TootiiT' , p'), (T, p)) < 6, then there exist subtrees iS,p) T and 
iS',p') T' for which: 

(i) duiS,T) < e and duiS',T') < e, 

(ii) S and S' have the same total length, 

(iii) there is a bijective measurable map ^ : S ^ S' that preserves length 
measure and has distortion at most e, 
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(iv) the length measure of the set of points a G 5 such that {b' G S' : 
ip{a) < b'} 7^ ^/'({fe (z S : a < b}) (that is, the set of points a such 
that the subtree above ip{a) is not the image under ip of the subtree 
above a) is less than e. 

Proof. As in Lemma l6.21 denote by r/ the minimum of the (strictly positive) 
edge lengths of T. Let {T',p') be a finite rooted tree with 

(6.4) dG^roo.{{T',p'),{T,p))<6<^, 

where 6 depending on T and e will be chosen later. Set {T" , p') and / to be 
a subtree of T' and a function from T to T" whose existence is guaranteed 
by Lemma 16.21 for this choice of 6. Let {xi, . . . x„} denote the leaves of T 
and write x[ := f{xi) = f{xi) for i = 1, . . . , n. 

Define inductively subtrees Si, . . . , Sn oiT (all with root p) and S[, . . . ,S!^ 
of T" C T' (all with root p') as follows. Set Si := [p,yi] and S[ := [p,y[], 
where yi and y[ are the unique points on the arcs [p, xi] and [p' , x[], respec- 
tively, such that 

(6.5) dT{p,yi) = dT'{p,y'i) = dT{p,xi) A dT'{p,x[). 

Suppose that Si, ... , Sm and 5^, . . . , 5^ have been defined. Let z^+i and 
z'm+i be the points on Sm and S'^ closest to x^+i and x'^j^^. Put Sm+i '■= 
Sm^Zm+i,ym+i] and S'^^-^ := S'^U]z'^^^,y'^^^], where ym+i and y'^_^^ are 
the unique points on the arcs ]zm+i, Xm+i] and respectively, 
such that 

d-T' {Zm+l , ym+l ) = dr' {z'm+l , v'm+l ) 

= dT{Zra+l,Xm+l) ^ dx' {z'^+i, x'^j^^) . 

Set 5 := Sn and S' := S'^. 

Put zi := p, and z[ := p'. By construction, the arcs ]zk,yk], I < k < n, 
are disjoint and their union is S'yjp}. Similarly, the arcs Jz^, are disjoint 
and their union is S' \ {p'}. Moreover, the arcs ]zk,yk] and ]2:fc,yfc] have 
the same length (in particular, S and S' have the same length and part (ii) 
holds). We may therefore define a measure-preserving bijection ip between S 
and S' by setting il^{p) = p' and letting the restriction of to each arc J^fc; ^fc] 
be the obvious length preserving bijection onto ]2;^,yy. More precisely, if 
a ^Zk,yk\, then ^(a) is the uniquely determined point on l-z^,?/^ such that 
^^5'(4'V'(«)) = ds{zk,a). 

We next estimate the distortion of tp to establish part (iii). We first claim 
that for a,b G S, 

(6.7) \ds{a,b)-ds'{i^ia),ijm<5l, 
where 

(6.8) 7 := max |d5(yfc, ym) -d5'(yfc> y^)! V max \ds{yk, p) -ds'{y'k, p)\- 

l<K,m<n l<fe<n 

To see (|6.7|) . consider a,b £ S\{p} with a €.]zk,yk] and b £]zm,ym] where 
k ^ m. (The case where a = p or b = p holds "by continuity" and is left 
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to the reader.) Without loss of generahty, assume that k < m, so that 
Vk ^Vm < Zm < b < Hm the partial order on S and y'f^Ay'^<z'^< ■0(6) < 
y'j^ in the partial order on S' . Note that yk A ym and Zk are comparable in 
the partial order, as are A y'^ and z^. Moreover, by part (vi) of Lemma 
16.21 yk /\ym < Zk if and only if y'^ /\ y'm ^'k- then have to consider four 
cases depending on the relative positions oi yk A ym, a and A y^, ip{a). 

Case I: yk ^ym < a <yk and y'kAy'm< V'(a) < y'k- 
We have 

(6.9) ds{yk, ym) = ds{yk, a) + ds{a, b) + ds{b, ym) 
and 

(6.10) ds'{yly'm)=ds'{y'kMa))+ds'{^l^{a)Mb))+ds'{m,y'm)- 
By construction, 

(6.11) dsiyk,a) = ds'iy'k,^P{a)) 
and 

(6.12) ds{b,ym) = ds'{tp{b),ym). 
Hence 

(6.13) \ds{a,b) - ds'{'il^{a),ij{b))\ = \ds{yk,ym) - ds'{y'k,y'm)\ < 7- 

Case II: yk f\yra < a<yk and 0(a) <y'^Ny'm< v'k- 
Note that in this case Zk<,yk l\ ym ■ We again have 

(6.14) ds{yk-, ym) = dsiyk, a) + dsia, b) + ds{b, ym), 
but now 

ds'{yk,y'm) = ds'{y'k,'^{o)) + ds'{ip{a),il:{b)) + ds'{ip{b),ym) 
-2ds'{iP{a),y',,Ay'J. 

Let ye be such that Zk = ye /\yk = Vi A ym and hence z',^ = y'^ Ay',^ = y'^ Ay'm- 
Observe from Subsection 12 . 21 that 

(6.16) 

ds'{i^{a),y'k Ay'm) 

= ^{ds'{y'e,y'm) + ds'iy'k, p) - ds'{y'k,y'm) -ds'{y'i,p)) - ds'{zk,ip{a)) 

< ^{ds{ye,ym) + dsiyk, p) - dsiyk, ym) - dsiye,p)) + - dsizk,a) 
= dsizk,yk A ym) - dsizk,a) + 27 
<27, 
and hence 

(6.17) \dsia,b)-ds'ii^ia),m)\<^l- 
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Case III: a < yk /\ym < Vk and A < < y^- 

Note that in this case, z'^<y'^f\ y'^. This case is similar to Case II, but we 
record some of the details for use later in the proof of part (iv) . Letting the 
index £ be as in Case II, we have 

ds{yk,ym) = ds{yk,a) +ds{a,b) + ds{b,ym) 

(6.18) 

- ^as[a,yk A ym) 

and 

(6.19) ds'{y'k,y'm) = ds'{y'k,Ha)) + ds'{il'{a),ij{b)) + ds'{'il^{b),y'^). 
We have 

ds{a,yk A ym) 

= ^{ds{ye,ym) + ds{yk,p) - ds{yk,ym) - ds{ye,p)) - ds{zk,a) 

(6.20) ^ ^{ds'{y'e,y'm) + ds'{y'k,p') - ds'{y'k,y'm) - ds'{y'i,p')) + 

- ds'{z'k,i^{a)) 
= ds'{zk,yk Ay'm) - ds'{z'k,tp{a)) + 2j 
<27, 

and hence 

(6.21) \ds{a,b)-ds'ma),^bib))\<5-f. 

Case IV: a<ykA.ym<yk and V(a) < y'k A y'm < y'k- 
Letting the index i be as in Case II, we have 

(6.22) ds{zk,ym) = ds{zk,a) +ds{a,b) + ds{b,ym) 
and 

(6.23) ds'iz'i^y'J = ds'izlijia)) + ds'{iJ{a),^l^ib)) + ds'mb),y'J. 
Hence, from Subsection 12.21 

ds{a,b) - ds'{i^{a),Tp{b)) 
= ds{zk,ym) - ds'{z[,y'^) 

= dsiye /\ym,ym) - ds'{y'eAy'm,y'm) 

(6.24) ^ 

= -^ids{ym,p) +ds{ye,ym) - ds{yi,p)) 
- ^ids'{ym,p) +ds'{y'g,y'm) - ds'{y'e,p))- 

Thus 

(6.25) \ds{a,b)-ds'ma),ijm<ll- 
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Combining Cases I-IV, we see that (|(i.7|) holds. We thus require an es- 
timate of 7 to complete the estimation of the distortion of ip . Clearly, 

\ds{yk,ym) - ds'{y'k,y'm)\ < \dT{xk,Xjn) - dT"{x'k,x'^)\ 

(6.26) + dTiyk,Xk) + dT{ym,Xm) 

+ dT"iy'k^x'k)+dT"iy'm,x'^). 

By 

dT{yi,xi)\/ dT"{y'i,x[) = \dT{p,xi) -dT"{p',x[)\ 

(6.27) < dis(/) 

< 8<5. 

For 2 < k < n there exists by construction an index z£{l,2,...,/c — 1} such 
that Zk € [zi,yi] and G Wi^Vj}- Applying the observations of Subsection 

driVk^Xk) V dT"{y'k,x[) 
= \dT{yk,xk) - dT"{yk,x'k)\ 

= \dTiZk,Xk) - dT"{Zk,x'k)\ 

(6.28) < ^{\dT{xi,Xk) - dT"{xlx'i,)\ 

+ \dT{p,Xi) - dT"{p',x'i)\ + \dTip,Xk) -dT"(P:4)l} 

< ^dis(/) 

< 126. 
Thus, from ()6.26() . 

(6.29) \ds{yk,ym)-ds>{y'k,y'J\ < (8 + 4 x 12)5 = 565. 

A similar argument shows that \ds{yk, p) — ds'iy'k, p')\ ^ (8 + 2x12)5 = 32(5, 
and hence 7 < 565. Substituting into 1)6. 7() gives 

(6.30) dis(V') < 57 < (5 X 56)5 = 2805. 
Moving to part (i), apply ()6.28() to obtain 

(6.31) dii{S,T) < max d{yk,Xk) < 7 < 565 

l<i<n 

and, by similar arguments, 

(6.32) dniS', T') < dniS', T") + dn{T" ,T') < 595. 

Finally, we consider part (iv) . Suppose that a E 5 is such that the subtree 
of S' above ip{a) is not the image under of the subtree of S above a. Let 
k be the unique index such that a £]zk,yk] (and hence V'(o) G]^^,?/^). It 
follows from the construction of V' that there must exist an index £ such that 
either Zk < a < zi and z'f^ < z'^ < ip{a) or Zk < zg < a and < tp{a) < z\. 
These two situations have already been considered in Case III and Case II 
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above (in that order): there we represented zi as A and as y'^l\y'^. 
It fohows from the inequaUty (|6.2fl)) that the mass of the set of points o 
that satisfy the first alternative is at most 2771 < \Vlbn. Similarly, from 
the inequality (|6.16)) and the fact that ■0 is measure-preserving, the mass 
of the set of points a that satisfy the second alternative is also at most 
2771, < 1126n. Thus the total mass of the set of points of interest is at most 
224:6n. □ 

Before completing the proof of Proposition 16.11 we recall the definition 
of the Wasserstein metric. Suppose that {E, d) is a complete, separable 
metric space. Write B for the set of continuous functions functions / : 
E such that \f{x)\ < 1 and \ f{x) - f{y)\ < d{x,y) for x,y e E. The 
Wasserstein (sometimes transliterated as Vasershtein) distance between two 
Borel probability measures a and /? on is given by 

(6.33) d^^{a,P) := sup / fda— / fdf3 

f(^B J J 

The Wasserstein distance is a genuine metric on the space of Borel proba- 
bility measures and convergence with respect to this distance implies weak 
convergence (see, for example, Theorem 3.3.1 and Problem 3.11.2 of |EK86j ^. 
If V and W are two E'- valued random variables on the same probability space 
(S, A, P) with distributions a and f3, respectively, then 

dw(a,/5) <sup|P[/(y)]-P[/(W^)]| 
^^■^^^ < supP[|/(y) - f{W)\] < F[d{V,W)]. 

Proof of Pror)osition[6l\ For (T, p) G T''""'^ and t > 0, let 

(6.35) Pi((T,p),.):=P(^'^){^te-}. 

We need to show that (T,p) ^ P(((T, /)),•) is weakly continuous for each 
t > 0. This is equivalent to showing for each (T, p) G T''°°* and t > that 

(6.36) hm dw(Pi((T,p),-),Pt((r',p'),-)) =0. 

{T' ,p')^{T,p) 

^From the coupling argument in the proof of part (iii) of Theorem IHl (in 
particular, the inequality (|3.11|) ). we have that 

dy,{Pt{{T,p)r),-Pt{{T',p'),-)) 

<dy,{Pt{{T,p)r),-Pt{{Rv{T),p),-)) 

(6.37) + dw{Pti{RrjiT),p), •),Pt{iR^{T'),p'), .)) 

+ dw{PtmniT'),p)r),Pt{{T',p'),-)) 
<dw{Pt{iRr,{T),p),-),Pti{Rr,{T'),p'),-)) + 2r]. 

By part (ii) of Lemma l2.61 i?^(T') converges to Rri{T) as (T',p') converges 
to {T,p), and so it suffices to establish (|6.36|) when (T, p) and {T',p') are 
finite trees, and so we will suppose this for the rest of the proof. 
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Fix (T, p) and e > 0. Suppose that 5 > depending on (T, p) and e is suf- 
ficiently small that the conclusions of Lemma [6 . 31 hold for any (T', p') within 
distance 5 of {T,p). Let {S,p) and {S',p') be the subtrees guaranteed by 
Lemma 16.31 From the coupling argument in proof of part (iii) of Theorem |31 
we have 

(6.38) dw(Pt((r,/9),-),Pt((5,p),-)) <e 
and 

(6.39) dw{'Pti{T',p'),-),Pti{S',p'),-)) <e. 

It therefore suffices to give a bound on d-w(Pti{S, p), •),Pt{{S' , p), •)) that 
only depends on e and converges to zero as e converges to 0. 

Construct on some probability space (S,^, P) a Poisson point process Hq 
on the set M"'""'' x 5° with intensity p, where p is the length measure on 
S. Construct on the same space another independent Poisson point process 
on the set {{t,x) G x 1^++ : x < t} with intensity A (8) A restricted to 
this set. If we set H'q := {{t,^{x)) : (t,x) G Hq} C M++ x (S")°, then U'q is 
a Poisson process on the set M'^^ x {S')" with intensity A (8> p' , where p' is 
the length measure on S' (because ■0 preserves length measure). Now apply 
the construction of Subsection 13. II to realizations of IIq and 11 (respectively, 
IIq and 11) to get two T™°*-valued processes that we will denote by {Yt)t>o 
and (Yl)t>o. We see from the proof of Theorem 01 that Y (respectively, Y') 
has the same law as X under p(^<P^ (respectively, P^'^ 'f 

Define a map il^t from Yt = SU]0,t] to = 5'U]0,t] by setting the 
restriction of ipt to S he ip and the restriction of tpt to ]0,t] be the identity 
map. Let dt and d[ be the metrics on Yt and y/, respectively. We will bound 
the rooted Gromov-Hausdorff distance between Yt and Y^ by bounding the 
distortion of ■0*- 

The cut-times for Y and Y' coincide. If ^ is a cut-point of Y at some 
cut-time r, then the corresponding cut-point for Y' will be tp{£,). 

It is clear that the distortion of ipt is constant between cut-times. Write 
Bt for the set of points b £ Yt such that the subtree of y/ above tpt{b) is not 
the image under tpt of the subtree of Yt above b. The set Bt is unchanged 
between cut-times. 

Consider a cut-time r such that the corresponding cut-point ^ is in Yr- \ 
Br-- If X and y are in the subtree above in Y^-, then they are moved 
together by the re-grafting operation and their distance apart is unchanged 
in Yt-. Also, tpr-ix) and ipr-{y) are in subtree above ipr-{(,) in Y^_ and 
these two points are also moved together. More precisely, 

(6.40) drix, y) = dr- {x, y) 
and 



(6.41) 



d'^{tpr{x),tl}r{y)) = d'^_{lpr-{x),'>pT-{y))- 
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The same conclusion holds if neither x ox y are in the subtree above ^ in 
Yr-. If X is in the subtree above ^ in Yr- and y is not, then 

(6.42) dr{x,y) = dr-{x,^) + dr-{T,y) 
and 

(6.43) d'Mr{x),My)) = <_(^r-(x),Vr-(0) +<-(^,^r-(y)) 

(where we recall that r is the root in each of the trees Yt—,Yt-,Y^_,Y^). 
Combining these cases, we see that 

(6.44) dis(^^) < 2dis(V'r-)- 

Moreover, if ^ G Y^- \ Br-, then B^- = B^-. 
Also, for any t > we always have the upper bound 

dis(V'f ) < diam(yi) + diam(y/) 

< diam(5) + diam(5') + 2t 

< diam(r) + diam(r') + 2t 

6.45 

< 2diam(r) + dcH-"* ({T, p), {T',p)) + 2t 

< 2diam(T) + 6 + 2t 
=:Dt 

Set Nt := |no n {]0,t] X S")! + |n n {{s,x) : O < x < s < t}| and write It 
for the indicator of the event {Ho H (]0, t] x Bq) ^ 0}, which, by the above 
argument, is the event that ^ € Bt- for some (cut-time, cut-point) pairs 
(r, ^) with < r < t. We have 

dw{Pt{{S,p),-),Pt{{S',p),-) 

<F[dG^root{Yt,Yl)] 



(6.46) 



< ^P[dis(V't)] 

< Ip [£2^*+/,D,] 

= ^ jeexp (^p{T)t +^-^+[1- exp {-et)] Dt 
and this suffices to complete the proof. □ 
7. ASYMPTOTICS OF THE AlDOUS-BRODER ALGORITHM 

Given an irreducible Markov matrix P with state space V, there is a 
natural probability measure on the collection of combinatorial trees with 
vertices labeled by V that assigns mass 

(7.1) C-'l[nx,y) 

to the tree T, where C is a normalization constant and the product is over 
pairs of adjacent vertices (x, y) in T ordered so that y is on the path from 
the root to x. For example, if P(x, y) = 1/|F| for all x, y G F, (so that 
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the associated Markov chain consists of successive uniform random picks 
from V), then the distribution H7.1() is uniform on the set of rooted 
combinatorial trees labeled by V. 

The Aldous-Broder algorithm |AT89[ IBro89l lAldOOj is a tree- valued 
Markov chain that has the distribution in 1)7. ip as its stationary distribu- 
tion. The discrete time version of the algorithm has the following transition 
dynamics. 

• Pick a vertex v at random according to P(p, •), where p is the current 
root. 

• If f = p, do nothing. 

• li V p: 

— Erase the edge connecting v to the unique vertex adjacent to v 
and on the path from p to v. 

— Insert a new edge between v and p. 

— Designate v as the new root. 




Figure 4 illustrates Aldous-Broder algorithm in discrete time. Here 
the dots mark the vertices that become the root in the next step. Once 
the new root is chosen we erase the edge adjacent to the new root on the 
path from the old to the new root and insert an edge connecting the old to 
the new root. The bold edges in the picture present the edges introduced 
recently. 

It will be more convenient for us to work with the continuous time version 
of this algorithm in which the above transitions are made at the arrival 
times of an independent Poisson process with rate — 1) (so that the 

continuous time chain makes actual jumps at rate 1). 

We can associate a rooted compact real tree with a rooted labeled com- 
binatorial tree in the obvious way by thinking of the edges as line segments 
with length 1. Because we don't record the labeling, the process that arises 
from mapping the continuous-time Aldous-Broder algorithm in this way 
won't be Markovian in general. However, this process will be Markovian in 
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the case where P is the transition matrix for i.i.d. uniform samphng (that 
is, when ¥{x,y) = 1/\V\ for all x,y (zV) and we assume this from now on. 
The following result says that if we rescale "space" and time appropriately, 
then this process converges to the root growth with re-grafting process. If 
T = (T, d, p) is a rooted compact real tree and c > 0, we write cT for the 
tree (T,cd,p) (that is, cT = T as sets and the roots are the same, but the 
metric is re-scaled by c). 

Proposition 7.1. LetY^ = {Y^)t>o he a sequence of Markov processes that 
take values in the space of rooted compact real trees with integer edge lengths 
and evolve according to the dynamics associated with the continuous-time 
Aldous-Broder chain for i.i.d. uniform sampling. Suppose that each tree Yq 
is non-random with total branch length Nn, that Nn converges to infinity 

— 1/2 

as n —>■ oo, and that Nn ' Yf^ converges in the rooted Gromov-Hausdorff 
metric to some rooted compact real tree T as n —>■ oo. Then, in the sense of 
weak convergence of processes on the space of cddldg paths equipped with the 
Skorohod topology, (iV„, ' ^"(A^n' i))t>o converges as n ^ oo to the root 
growth with re-grafting process X under . 

Proof Define Z" = {Z]^)t>o by 

(7.2) := iV-l/2yn(^l/2^). 

For ?? > 0, let Z'''" be the T^^^-valued process constructed as follows. 

. Set Zo^'" = i2^„(Zo"), where ry„ := N-^'^[N^'\\. 

• The value of Z^''^ is unchanged between jump times of {Z")t>o. 

• At a jump time r for (Z")t>o, the tree Zr'"" is the subtree of Z" 
spanned by Z^'" and the root of Z". 

An argument similar to that in the proof of Lemma \'A.2\ shows that 

(7.3) SUpdH(^r,^r) <^n, 

t>0 

and so it suffices to show that Z'''" converges weakly as n — > oo to X under 

pi?„(T)_ 

Note that Zg''" converges to Rrj{T) as n ^ oo. Moreover, if A is the map 
that sends a tree to its total length (that is, the total mass of its length 
measure), then lim„_+oo ^(-^o") = ^ ° RrjiT) < oo by Lemma ESI below. 

The pure jump process Z^'"' is clearly Markovian. If it is in a state (T', p'), 
then it jumps with the following rates. 

• With rate Nn^'^{Nn^'^A{T'))/Nn = A{T'), one of the Nn^'^A{T') 

points in T' that are at distance a positive integer multiple of A^^^ 
from the root p' is chosen uniformly at random and the subtree above 
this point is joined to p' by an edge of length Nn . The chosen 

— 1/2 

point becomes the new root and an arc of length Nn that previ- 
ously led from the new root toward p' is erased. Such a transition 
results in a tree with the same total length as T' . 
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• With rate Nn' - A{T'), a new root not present in T' is attached 

I/O 

to p' by an edge of length Nn ■ This results in a tree with total 
length A(r') + 7V~^/^ 

It is clear that these dynamics converge to those of the root growth with 
re- grafting process, with the first class of transitions leading to re- graftings 
in the limit and the second class leading to root growth. □ 

Lemma 7.2. Let (T, d, p) € T™°* and suppose that {xq, • • • , Xn} C T spans 
T, so that the root p and the leaves ofT form a subset of {xq, . . . ,Xn}- Then 
the total length of T (that is, the total mass of its length measure) is given 
by 

n ^ 

d(xo,xi) + l\ -{d{xk,Xi) + d{xk,Xj)-d{xi,Xj)). 

k=20<i<j<k-l 

Proof. This follows from the observation that the distance from the point 
Xk to the arc [xj,Xj] is 

^ {d{xk,Xi) + d{xk,Xj) - d{xi,Xj)) , 

and so length of the arc connecting x^, 2 < k < n, to the subtree spanned 
by xo,...,Xfc„i is 

/\ ^ {d{xk,Xi) + d{xk,Xj) - d{xi,Xj)) . 

0<i<j<k-l 

□ 

Lemma 7.3. Let A : T™"* ^ R U {oo} be the map that sends a tree to its 
total length. For r] > 0, the map A o i?^ is continuous. 

Proof. For all > we have by Lemma ITBl that : i?^ = i?^/2 °I^r)/2i the map 
Rrj is continuous, and the range of Rr^ consists of finite trees. It therefore 
suffices to show for all > that if (T, d, p) is a fixed finite tree and (T', d' , p') 
is any another finite tree sufficiently close to T, then A o R^(T') is close to 
Aoi?^(T). 

Suppose, therefore, that (T, d, p) is a fixed finite tree with leaves 
{xi, . . . , Xn} and that (T', d' , p') is another finite tree with 

d^^.oot{{T,d,p),{T' ,d\p')) < 5, 

where 6 is small enough that the conclusions of Lemma 16.21 hold. Consider 
a rooted subtree {T",d',p') of {T',d',p') and a map f : T —> T" with the 
properties guaranteed by Lemma Set x'^ = f{xk) for 1 < /c < n. 

Fix K > 0. For 1 < A: < n, write x^ £ T for the point on the arc [/), x^] 
that is at distance k A d{p, Xfc) from x^. Set xq := p. Define Xg, ■ • ■ , x'^ € 
T" similarly. Note that Rk(T) is spanned by {xq, . . . , Xn} and R^iT") is 
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spanned by {x'q, . . . , x^}. By Lemma E21 

n ^ 

A o ^^(r) = d{xo, xi) + /\ -{d{xk,Xi) + d{xk,Xj)-d{xi,Xj)). 



k=20<i<j<k-l 



and 



AoR^iT") = d'{x'„x[)+Y^ /\ ^ {d'{x'„x'^ + d'{x',,x]) - d'ixlx'j)) . 

k=20<i<j<k-l 

Also observe that 

d{xi,Xj) = {d{xo,Xi) - k)+ + {d{xo,Xj) - k)+ 



{d{xo,Xi) - k)+ a {d{xo,Xj) - k)+ 
A |^(d(xo,Xj) + d{xo,Xj) - d{xi,Xj))^ 



and 



d'{xi,x^ = {d'{xQ,x'i) - k)+ + {d'{xQ,x',) - k)+ 



(d (xq, Xi) - k)+ a {d (x'o, xj) - k)+ 
A [^{d'{x'„x'i)+d'{x'„x'j) - d'{x'„x'^))] 



Now the function 1 1— > (i — is Lipschitz with Lipschitz constant 1 for all 
K > 0, and it follows that there is a family of Lipschitz functions F^, k > 0, 
with Lipschitz constants uniformly bounded by some constant C such that 



Aoi?4T) = F«((d(x„x,))o<,^^.<„) 



and 

Aoii.(n = F.((ci'(x'„x;.))„<^_^.<„^. 
By construction \d{xi,Xj) — d'{x'-,x'j)\ < 86, and so 

\AoR,{T)-AoR,{T")\<86C 

for all K > 0. 

Because d}i{T',T") < 36, we have 

A o R^{T") < A o R^{T') < A o Rr,-3s{T"). 

Thus 

A o R^{T) -8dC <Ao R^{T') < A o RnssiT) + 85C. 
Since lim^j^g ^°Rri-35iT) = Aoi?^(T), this suffices to establish the result. □ 

An alternative algorithm for simulating from the distribution in (|7.1() in 
the case of i.i.d. uniform sampling is the complete graph special case of 
Wilson's loop-erased walk algorithm for generating a uniform spanning tree 
of a graph [PWQSf fWilQG , WP96j . Asymptotics of the latter algorithm have 
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been investigated in |Pitn2bj . Wilson's algorithm was also used in |PE,n4j 
to show that the finite-dimensional distributions of the re-scaled uniform 
random spanning tree for the d-dimensional discrete torus converges to the 
Brownian CRT as the number of vertices goes to infinity when d > 5. 

8. Rayleigh process 

Suppose that we take the root growth with re-grafting process {Xt)t>o 
under P"'" for some T G 'proot^ point x G T, and we denote by 

Rt the distance between x and the root t of Xt (that is, Rt is the height 
of X in Xt). According to the root growth with re-grafting dynamics, Rt 
grows deterministically with unit speed between cut-time r for which the 
corresponding cut-point falls on the arc [t,x]. Such cut-times r come along 
at intensity Rt- dt in time, and at r— the position of the corresponding cut- 
point is uniformly distributed on the arc [r, x] conditional on the past up to 
r— , so that Rr is uniformly distributed on [0,/?,-- ] conditional on the past 
up to r— . Consequently, the R"''-valued process {Rt)t>o is autonomously 
Markovian. In particular, {Rt)t>o is an example of the class of piecewise 
deterministic Markov processes discussed in the Introduction. 

In order to describe the properties of {Rt)t>o, we need the following defi- 
nitions. A non- negative random variable R is said to have standard Rayleigh 
distribution if it is distributed as the length of a standard normal vector in 
R^, that is, 

(8.1) P{i2 > r} = exp ^-y^ , r > 0. 

If R* is distributed according to the size-biased standard Rayleigh distribu- 
tion, that is, 

(8.2) nR*edr}='^i§^ = ^r^e-'^^'dr, r > 0, 

and if [/ is a uniform random variable that is independent of R* , then UR* 
has the inverse size-biased standard Rayleigh distribution: 

(8.3) P{C/i?*edr} = ^-^^|^|^ = y|e-^'dr, r > 0. 

Thus R* and U R* are distributed as the length of a standard normal vector 
in and M, respectively. 

For reasons that are apparent from ProDosition l8.1l below. we call the pro- 
cess {Rt)t>o the Rayleigh process. We note that there is a body of literature 
on stationary processes with Rayleigh one-dimensional marginal distribu- 
tions that arise as the length process of a vector-valued process in with 
coordinate processes that are independent copies of some stationary centered 
Gaussian process (see, for example, |Has7ni llVfRR58l lRSn2] V 
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Proposition 8.1. Consider the Rayleigh process {Rt)t>o- Write P*^' for the 
law of {Rt)t>o started at r >0. 

(i) The unique stationary distribution of the Rayleigh process is the stan- 
dard Rayleigh distribution and the total variation distance between 
P^{Rt € •} and the standard Rayleigh distribution converges to as 
t — ^ oo. 

(ii) Under P^, for each fixed t > 0, Rt has the same law as RAt, where 
R has the standard Rayleigh distribution. 

(iii) For x > 0, the mean return time to x is x^^e^^ . 

(iv) IfTn denotes the nth jump time of {Rt)t>o, then as n ^ oo the triple 
{Rt„, Rt„+i) converges in law to the triple {U'R* , R* , U" R*), 
where U' and U" are independent uniform random variables on ]0,1[ 
independent of R* , and R* has the size-biased Rayleigh distribution. 

(v) The jump counting process N{t) := |{n G N : t„ < t}| has asymptot- 
ically stationary increments under P^ for any r >0, and 

(8.4) l7V(t)^y|, p--a.s. 

as t ^ oo. 

Proof, (i) Let fl be a Poisson point process in MxM_|_ witii Lebesgue intensity. 
For — oo < t < oo let 

(8.5) Rt := inf{x + {t - s) : {s, x) £U, s< t}. 

It is clear that (Rt)t£R is a stationary Markov process with the transition 
dynamics of the Rayleigh process. Similarly, for r G R+ and t > 0, set 

(8.6) Rl = {r + t)A mi{x + (t - s) : {s, x) eU,0<s< t}. 

Then {Rl)t>o has the same law as the Rayleigh process under P^. 

Note that the event {Rt > r} is the event that n has no points in the 
triangle with vertices (t — r, 0), (t, 0), (t, r) and area Thus P{-R( > 

r} = exp(— r^/2) and the standard Rayleigh distribution is a stationary 
distribution for the Rayleigh process. 

Let T'' := mi{t > : R^ = Rt}. Note that Rl = Rt for all t > T\ 
Note that T^ > t if and only if either Rq > r and 11 puts no points into the 
quadrilateral with vertices (0,0), (t, 0), (t, r + t), (0, r), or < ^ and fl 
puts no points into the quadrilateral with vertices (0,0), (t, 0), {t,Ro + t), 
{0,Ro). Hence 

P{r'' >t} = exp (-y) exp ^-^(r + (r + t))t 

+ J exp I --(x + (x + t))i 1 xexp ( — —j dx. 

By the standard coupling inequality, the total variation between P^'{Rt G •} 
and f{Rt G ■} is at most 2P{T^ > t}, which converges to as t ^ oo. 
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This certainly shows that the standard Rayleigh distribution is the unique 
stationary distribution. 

(ii) Note that > x if and only iir + t>t>x and there are no points 
of n in the triangle with vertices (t — x,0), (t, 0), {t,x) of area or r + 
t > X > t and there are not points of n in the quadrilateral with vertices 



(0,0),(t,0),(t,2;),(0,x 
either case, 



t) of area ((x - t) + x)t/2 = x^/2 



tY/2. In 



V^'iRt > x} = l{r + t > xjexp ( -^x^ + ^((x -t)^ 



Taking r = gives the result, 
(iii) Let Ty := inf{t > Q : Rt 



y}. It is obvious from the Poisson con- 



struction that, for all X > and y > 0, P^{0 < Ty < oo} = 1 and 
P^[exp(uTy)] < oo for all u in some neighbourhood of 0. The Laplace 
transforms P^[exp— ATy] are determined by standard methods of renewal 
theory: 

C/x(A) 



(8.9) 
where by (|8?8|) . 

(8.10) 



[exp -AT, 



dt e 



' l + f/x(A)' 
,,P^{Rtedx} 



A > 0, 



dt e"^* t e 



dx 



+ F{R e dx}- 



-Xx 



A 



In particular, it follows easily that the mean return time of state x 



i.ll) 



P^'[r,] = -hmip^'[exp-Arj 



is the inverse of the density of -R at x, that is x ^e2^ , as claimed, 
(iv) Let T := inf{t > : Rt Rt-}- By part (i), the joint distribution 
of (Rr^, Rr„^j^-, Rt„^-^) converges to the joint distribution of (Rq, Rf-, Rf) 
conditional on 7^ Ro~ ■ Let C denote the intensity of the stationary point 
process {t £ R : Rt Rt-} ■ Then 

P{^0 G dx, Rf^ G dy, Rf£dz\Ro^ Ro-} 



^.12) 



exp 



1 



dx exp 



1 



(x + y)(y-x) dydz 







— dx 


X 


y 





1 . 







X 


- dz 




y 



2 

C^^y^exp ( -^y^ 1 dy 
for X < y and z < y. The result now follows from (|8.2|) . which also identifies 
(v) The stationary point process {t G 



: Rt 7^ Rt-} is clearly ergodic by 
construction, and it has intensity yvr/2 from the argument in part (iv). For 
any r > 0, it follows from the argument in part (i) that i?[ = Rt for all t 
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sufficiently large, and so the result follows from the ergodic theorem applied 

to {t £R: Rt^ Rt-}. □ 

Then the following corollary is a consequence of Proposition 17.11 See 
|DGR02j ■ where similar scaling limits are derived. 

Corollary 8.2. For each N e N, let (fif ) t>o denote a continuous time 
Markov chain with state space {!,..., A^} and infinitesimal generator matrix 



[ l/N, 1 < i < i - 1, 

-(iV-l)/iV, j = i, 

{N - i)/N, j = i + l, 

0, otherwise. 



(8.13) Q^{i,j) :-- 

Write p^'**^ r € {1,...,A^}, for the corresponding family of laws. If a 



sequence (r^jN^fq, r^ € {1,...,A^}, is such that lim7v->oo-^ ^^'^^^ = r, 
exists, then the law of N~~^/'^ (r^^^ under P^'^^ converges to that of 



tVN 



the Rayleigh process {Rt)t>o under F^°° in the usual sense of convergence of 
cddldg processes with the Skorohod topology. 
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